Measured performance
Provider & model performance
Measured time-to-first-token, time-to-first-byte, throughput, and uptime for every LLM provider and model TrustedRouter routes to — continuously sampled, not vendor-claimed.
Last updated
2026-06-09T13:26:14Z
Continuously sampled from TrustedRouter's monitor regions over the 5,000-sample benchmark set — time-to-first-token (TTFT),
time-to-first-byte (TTFB), throughput, and success rate measured on real streaming
requests, not vendor-claimed. Unsupported route and probe-configuration rows are
reported separately and do not count as provider downtime. No prompt or output
content is ever stored.
Providers
Ranked by measured p50 time-to-first-token across all of a provider's models in the 5,000-sample benchmark set (22 providers · 2770 samples).
| # | Provider | Models | p50 TTFT | Throughput | Uptime | Errors | Config excluded | Samples |
|---|---|---|---|---|---|---|---|---|
| 1 | mistral | 8 | 811 ms | — | 100.00% | — | — | 131 |
| 2 | lightning | 1 | 885 ms | — | 100.00% | — | — | 125 |
| 3 | deepinfra | 7 | 945 ms | — | 99.28% | provider_error 1% |
— | 139 |
| 4 | grok | 2 | 985 ms | — | 100.00% | — | — | 144 |
| 5 | cerebras | 4 | 992 ms | — | 91.06% | provider_error 9% |
— | 123 |
| 6 | parasail | 24 | 1029 ms | — | 85.19% | provider_error 15% |
3 probe_config_error |
108 |
| 7 | gemini | 5 | 1060 ms | — | 100.00% | — | 12 probe_config_error |
73 |
| 8 | together | 3 | 1178 ms | — | 98.45% | provider_error 2% |
— | 129 |
| 9 | phala | 18 | 1206 ms | — | 96.77% | provider_error 3% |
— | 124 |
| 10 | openai | 11 | 1221 ms | — | 100.00% | — | — | 135 |
| 11 | tinfoil | 5 | 1258 ms | — | 84.06% | provider_error 16% |
— | 138 |
| 12 | venice | 11 | 1354 ms | — | 98.55% | provider_error 1% |
— | 138 |
| 13 | zai | 11 | 1494 ms | — | 96.80% | provider_error 3% |
— | 125 |
| 14 | novita | 71 | 1504 ms | — | 92.48% | provider_error 8% |
— | 133 |
| 15 | deepseek | 2 | 1540 ms | — | 100.00% | — | — | 130 |
| 16 | nebius | 24 | 1562 ms | — | 100.00% | — | 1 probe_config_error |
107 |
| 17 | minimax | 6 | 1662 ms | — | 100.00% | — | — | 137 |
| 18 | kimi | 2 | 1691 ms | — | 93.10% | provider_error 7% |
— | 116 |
| 19 | anthropic | 10 | 1717 ms | 45 tok/s | 100.00% | — | — | 126 |
| 20 | siliconflow | 7 | 1882 ms | — | 90.98% | provider_error 9% |
— | 122 |
| 21 | xiaomi | 5 | 2360 ms | 378 tok/s | 100.00% | — | — | 140 |
| 22 | gmi | 5 | 3126 ms | — | 84.25% | empty_stream 16% |
— | 127 |
Models
Models sampled in the 5,000-sample benchmark set, fastest measured TTFT first. Rows with few samples are marked — more data sharpens the numbers.
| # | Model | Provider | p50 TTFT | p95 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|---|---|---|
| 1 | nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 limited data | nebius | 458 ms | 575 ms | 457 ms | — | 100.00% | — | 2 |
| 2 | qwen/qwen3.5-27b | deepinfra | 470 ms | 1021 ms | 469 ms | — | 100.00% | — | 21 |
| 3 | qwen/qwen3-next-80b-a3b-instruct limited data | parasail | 636 ms | 652 ms | 634 ms | — | 100.00% | — | 3 |
| 4 | mistralai/mistral-small-2603 limited data | mistral | 661 ms | 1472 ms | 659 ms | — | 100.00% | — | 12 |
| 5 | meta-llama/llama-3-70b-instruct limited data | novita | 695 ms | 1303 ms | 693 ms | — | 100.00% | — | 2 |
| 6 | qwen/qwen3-vl-30b-a3b-instruct limited data | phala | 695 ms | 8274 ms | 693 ms | — | 100.00% | — | 6 |
| 7 | qwen/qwen2.5-vl-72b-instruct limited data | phala | 711 ms | 5230 ms | 709 ms | — | 100.00% | — | 9 |
| 8 | mistralai/mistral-small-3.2-24b-instruct limited data | parasail | 715 ms | 1161 ms | 714 ms | — | 100.00% | — | 4 |
| 9 | thedrummer/cydonia-24b-v4.1 limited data | parasail | 715 ms | 875 ms | 713 ms | — | 100.00% | — | 3 |
| 10 | mistralai/ministral-8b-2512 limited data | mistral | 721 ms | 1157 ms | 720 ms | — | 100.00% | — | 19 |
| 11 | Sao10K/L3-8B-Stheno-v3.2 limited data | novita | 728 ms | 728 ms | 726 ms | — | 100.00% | — | 1 |
| 12 | mistralai/mistral-medium-3-5 limited data | mistral | 771 ms | 1060 ms | 769 ms | — | 100.00% | — | 19 |
| 13 | google/gemma-3-27b-it limited data | phala | 800 ms | 1312 ms | 788 ms | — | 100.00% | — | 7 |
| 14 | mistralai/ministral-3b-2512 limited data | mistral | 811 ms | 1286 ms | 769 ms | — | 100.00% | — | 17 |
| 15 | openai/gpt-oss-120b limited data | parasail | 814 ms | 1289 ms | 811 ms | — | 100.00% | — | 4 |
| 16 | meta-llama/llama-4-maverick limited data | parasail | 815 ms | 1060 ms | 712 ms | — | 100.00% | — | 4 |
| 17 | google/gemma-3-27b-it limited data | novita | 828 ms | 828 ms | 826 ms | — | 100.00% | — | 1 |
| 18 | mistralai/mistral-large limited data | mistral | 830 ms | 1909 ms | 828 ms | — | 100.00% | — | 12 |
| 19 | meta-llama/Llama-3.3-70B-Instruct limited data | nebius | 830 ms | 11370 ms | 829 ms | — | 100.00% | — | 4 |
| 20 | cerebras/gpt-oss-120b | cerebras | 839 ms | 2480 ms | 759 ms | — | 80.77% | — | 26 |
| 21 | qwen/qwen-2.5-7b-instruct limited data | phala | 845 ms | 1485 ms | 742 ms | — | 83.33% | — | 12 |
| 22 | deepseek/deepseek-v3.2 limited data | parasail | 846 ms | 3370 ms | 844 ms | — | 100.00% | — | 5 |
| 23 | mistralai/mistral-small-3.2-24b-instruct limited data | mistral | 847 ms | 1107 ms | 768 ms | — | 100.00% | — | 18 |
| 24 | mistralai/ministral-14b-2512 limited data | mistral | 854 ms | 1687 ms | 751 ms | — | 100.00% | — | 19 |
| 25 | Qwen/Qwen2.5-VL-72B-Instruct limited data | nebius | 859 ms | 7011 ms | 858 ms | — | 100.00% | — | 5 |
| 26 | qwen/qwen-2.5-7b-instruct | together | 861 ms | 1224 ms | 763 ms | — | 100.00% | — | 38 |
| 27 | openai/gpt-4o limited data | openai | 864 ms | 6080 ms | 780 ms | — | 100.00% | — | 15 |
| 28 | z-ai/glm-5.1 limited data | parasail | 867 ms | 1486 ms | 864 ms | — | 100.00% | — | 3 |
| 29 | bytedance/ui-tars-1.5-7b limited data | parasail | 869 ms | 1294 ms | 765 ms | — | 100.00% | — | 5 |
| 30 | google/gemma-4-26b-a4b-it limited data | deepinfra | 876 ms | 7029 ms | 820 ms | — | 100.00% | — | 13 |
| 31 | openai/gpt-4.1-mini limited data | openai | 883 ms | 7254 ms | 880 ms | — | 100.00% | — | 8 |
| 32 | Qwen/Qwen3-32B limited data | nebius | 883 ms | 1796 ms | 881 ms | — | 100.00% | — | 4 |
| 33 | google/gemma-4-31b-it | lightning | 885 ms | 1408 ms | 783 ms | — | 100.00% | — | 125 |
| 34 | thedrummer/skyfall-36b-v2 limited data | parasail | 888 ms | 899 ms | 788 ms | — | 100.00% | — | 3 |
| 35 | z-ai/glm-4.7-flash limited data | venice | 893 ms | 2757 ms | 891 ms | — | 100.00% | — | 8 |
| 36 | mistralai/mistral-nemo limited data | mistral | 895 ms | 1737 ms | 791 ms | — | 100.00% | — | 15 |
| 37 | google/gemini-3.1-flash-lite-preview | gemini | 899 ms | 1333 ms | 897 ms | — | 100.00% | — | 20 |
| 38 | NousResearch/Hermes-4-70B limited data | nebius | 905 ms | 1211 ms | 902 ms | — | 100.00% | — | 4 |
| 39 | openai/gpt-oss-120b | cerebras | 912 ms | 10113 ms | 808 ms | — | 73.91% | — | 23 |
| 40 | sao10k/l3-8b-lunaris limited data | novita | 913 ms | 913 ms | 808 ms | — | 100.00% | — | 1 |
| 41 | meta-llama/llama-3.3-70b-instruct | tinfoil | 921 ms | 1337 ms | 818 ms | — | 100.00% | — | 23 |
| 42 | qwen/qwen3-30b-a3b-instruct-2507 limited data | phala | 930 ms | 1792 ms | 827 ms | — | 100.00% | — | 4 |
| 43 | google/gemma-3-4b-it | deepinfra | 933 ms | 6841 ms | 830 ms | — | 100.00% | — | 25 |
| 44 | openai/gpt-4.1-nano limited data | openai | 936 ms | 6064 ms | 833 ms | — | 100.00% | — | 15 |
| 45 | Qwen/Qwen3-235B-A22B-Instruct-2507 limited data | nebius | 944 ms | 1160 ms | 943 ms | — | 100.00% | — | 4 |
| 46 | google/gemma-4-31b-it limited data | deepinfra | 945 ms | 2366 ms | 843 ms | — | 93.75% | — | 16 |
| 47 | meta-llama/llama-3.1-8b-instruct limited data | novita | 972 ms | 1202 ms | 971 ms | — | 100.00% | — | 2 |
| 48 | openai/gpt-oss-120b | tinfoil | 976 ms | 1336 ms | 974 ms | — | 100.00% | — | 24 |
| 49 | meta-llama/llama-3.1-70b-instruct | deepinfra | 977 ms | 1253 ms | 874 ms | — | 100.00% | — | 23 |
| 50 | google/gemma-3-27b-it | deepinfra | 978 ms | 2105 ms | 889 ms | — | 100.00% | — | 24 |
| 51 | qwen/qwen3.5-9b limited data | venice | 979 ms | 1412 ms | 876 ms | — | 85.71% | — | 14 |
| 52 | meta-llama/llama-3.3-70b-instruct limited data | novita | 982 ms | 1258 ms | 980 ms | — | 100.00% | — | 2 |
| 53 | x-ai/grok-4.20 | grok | 985 ms | 3208 ms | 887 ms | — | 100.00% | — | 73 |
| 54 | qwen/qwen3-coder-next limited data | parasail | 986 ms | 1082 ms | 883 ms | — | 100.00% | — | 3 |
| 55 | meta-llama/llama-4-scout-17b-16e-instruct limited data | novita | 988 ms | 988 ms | 885 ms | — | 100.00% | — | 1 |
| 56 | cerebras/zai-glm-4.7 | cerebras | 992 ms | 2284 ms | 975 ms | — | 100.00% | — | 33 |
| 57 | google/gemma-3-12b-it limited data | deepinfra | 992 ms | 1435 ms | 890 ms | — | 100.00% | — | 17 |
| 58 | google/gemma-4-26b-a4b-it limited data | parasail | 1000 ms | 3413 ms | 897 ms | — | 100.00% | — | 6 |
| 59 | qwen/qwen3-vl-8b-instruct limited data | parasail | 1001 ms | 1443 ms | 898 ms | — | 100.00% | — | 5 |
| 60 | openai/gpt-oss-120b limited data | nebius | 1015 ms | 1439 ms | 1014 ms | — | 100.00% | — | 4 |
| 61 | qwen/qwen3-vl-235b-a22b-instruct limited data | parasail | 1029 ms | 2340 ms | 1027 ms | — | 100.00% | — | 4 |
| 62 | meta-llama/llama-3.3-70b-instruct limited data | parasail | 1041 ms | 1473 ms | 1039 ms | — | 100.00% | — | 4 |
| 63 | inclusionai/ling-2.6-flash limited data | novita | 1045 ms | 1607 ms | 1044 ms | — | 100.00% | — | 2 |
| 64 | z-ai/glm-4.7 | cerebras | 1052 ms | 2824 ms | 986 ms | — | 100.00% | — | 41 |
| 65 | google/gemini-2.5-flash-lite limited data | gemini | 1055 ms | 1242 ms | 954 ms | — | 100.00% | — | 13 |
| 66 | zai-org/autoglm-phone-9b-multilingual limited data | novita | 1057 ms | 1057 ms | 1056 ms | — | 100.00% | — | 1 |
| 67 | google/gemini-2.5-flash limited data | gemini | 1060 ms | 1493 ms | 1026 ms | — | 100.00% | — | 16 |
| 68 | openai/gpt-oss-120b limited data | phala | 1070 ms | 5625 ms | 965 ms | — | 100.00% | — | 13 |
| 69 | google/gemma-3-27b-it limited data | parasail | 1081 ms | 1278 ms | 976 ms | — | 100.00% | — | 3 |
| 70 | NousResearch/Hermes-4-405B limited data | nebius | 1089 ms | 1214 ms | 1087 ms | — | 100.00% | — | 2 |
| 71 | qwen/qwen3-vl-30b-a3b-instruct limited data | novita | 1093 ms | 1093 ms | 989 ms | — | 100.00% | — | 1 |
| 72 | openai/gpt-4.1 limited data | openai | 1097 ms | 7417 ms | 994 ms | — | 100.00% | — | 14 |
| 73 | qwen/qwen3-235b-a22b-thinking-2507 limited data | venice | 1102 ms | 2494 ms | 998 ms | — | 100.00% | — | 13 |
| 74 | openai/gpt-oss-120b limited data | novita | 1109 ms | 1537 ms | 1006 ms | — | 100.00% | — | 2 |
| 75 | openai/gpt-oss-20b limited data | parasail | 1110 ms | 1255 ms | 1007 ms | — | 100.00% | — | 2 |
| 76 | qwen/qwen3.6-27b limited data | venice | 1114 ms | 2727 ms | 1012 ms | — | 100.00% | — | 12 |
| 77 | anthropic/claude-haiku-4.5 limited data | anthropic | 1119 ms | 1426 ms | 1015 ms | — | 100.00% | — | 9 |
| 78 | minimaxai/minimax-m1-80k limited data | novita | 1132 ms | 2188 ms | 1131 ms | — | 100.00% | — | 2 |
| 79 | qwen/qwen3.6-27b limited data | novita | 1137 ms | 1732 ms | 1135 ms | — | 100.00% | — | 2 |
| 80 | inclusionai/ling-2.6-1t limited data | novita | 1156 ms | 8026 ms | 1155 ms | — | 100.00% | — | 4 |
| 81 | google/gemini-3.1-flash-lite limited data | gemini | 1163 ms | 1646 ms | 1060 ms | — | 100.00% | — | 13 |
| 82 | qwen/qwen3-vl-30b-a3b-thinking limited data | novita | 1172 ms | 1997 ms | 1170 ms | — | 100.00% | — | 3 |
| 83 | google/gemma-4-26b-a4b-it limited data | novita | 1174 ms | 1864 ms | 1072 ms | — | 100.00% | — | 2 |
| 84 | moonshotai/kimi-k2.6 | together | 1178 ms | 8385 ms | 1176 ms | — | 95.74% | — | 47 |
| 85 | z-ai/glm-5-turbo limited data | zai | 1178 ms | 2481 ms | 1177 ms | — | 100.00% | — | 6 |
| 86 | google/gemma-3-27b-it limited data | nebius | 1181 ms | 1285 ms | 1107 ms | — | 100.00% | — | 5 |
| 87 | meta-llama/llama-3.3-70b-instruct | together | 1188 ms | 2377 ms | 1084 ms | — | 100.00% | — | 44 |
| 88 | anthropic/claude-opus-4.7 limited data | anthropic | 1190 ms | 1534 ms | 1188 ms | 45 tok/s | 100.00% | — | 16 |
| 89 | openai/gpt-5.4-mini limited data | openai | 1202 ms | 3606 ms | 1114 ms | — | 100.00% | — | 13 |
| 90 | qwen/qwen3.5-27b limited data | phala | 1206 ms | 4082 ms | 1132 ms | — | 100.00% | — | 11 |
| 91 | x-ai/grok-4.3 | grok | 1207 ms | 2256 ms | 1105 ms | — | 100.00% | — | 71 |
| 92 | qwen/qwen3-vl-235b-a22b-thinking limited data | novita | 1212 ms | 1977 ms | 1108 ms | — | 100.00% | — | 3 |
| 93 | z-ai/glm-4.5 limited data | zai | 1213 ms | 2619 ms | 1211 ms | — | 100.00% | — | 12 |
| 94 | moonshotai/Kimi-K2.5-fast limited data | nebius | 1213 ms | 1213 ms | 1212 ms | — | 100.00% | — | 1 |
| 95 | openai/gpt-4o-mini limited data | openai | 1221 ms | 3383 ms | 1117 ms | — | 100.00% | — | 13 |
| 96 | qwen/qwen2.5-vl-72b-instruct limited data | parasail | 1225 ms | 11062 ms | 1123 ms | — | 70.00% | — | 10 |
| 97 | qwen/qwen-mt-plus limited data | novita | 1232 ms | 1232 ms | 1129 ms | — | 100.00% | — | 1 |
| 98 | qwen/qwen3.6-35b-a3b limited data | novita | 1235 ms | 1543 ms | 1232 ms | — | 100.00% | — | 2 |
| 99 | Qwen/Qwen3-30B-A3B-Instruct-2507 limited data | nebius | 1236 ms | 1498 ms | 1134 ms | — | 100.00% | — | 5 |
| 100 | deepseek/deepseek-prover-v2-671b limited data | novita | 1254 ms | 1254 ms | 1252 ms | — | 100.00% | — | 1 |
| 101 | deepseek/deepseek-v4-pro | tinfoil | 1258 ms | 5748 ms | 1195 ms | — | 100.00% | — | 31 |
| 102 | z-ai/glm-4.5-air:free limited data | zai | 1267 ms | 8345 ms | 1265 ms | — | 100.00% | — | 9 |
| 103 | z-ai/glm-5.1 limited data | venice | 1285 ms | 5733 ms | 1182 ms | — | 100.00% | — | 15 |
| 104 | z-ai/glm-4.5-air limited data | zai | 1288 ms | 4443 ms | 1286 ms | — | 100.00% | — | 16 |
| 105 | moonshotai/kimi-k2.5 limited data | novita | 1296 ms | 5895 ms | 1295 ms | — | 100.00% | — | 2 |
| 106 | zai-org/glm-4.5-air limited data | novita | 1303 ms | 1756 ms | 1201 ms | — | 100.00% | — | 3 |
| 107 | deepseek/deepseek-v4-flash | deepseek | 1337 ms | 2053 ms | 1256 ms | — | 100.00% | — | 62 |
| 108 | minimax/minimax-m2 limited data | minimax | 1339 ms | 2165 ms | 1265 ms | — | 100.00% | — | 18 |
| 109 | z-ai/glm-4.6 limited data | venice | 1354 ms | 1968 ms | 1340 ms | — | 100.00% | — | 11 |
| 110 | openai/o3 limited data | openai | 1366 ms | 1953 ms | 1270 ms | — | 100.00% | — | 13 |
| 111 | google/gemma-4-31b-it limited data | novita | 1394 ms | 3149 ms | 1289 ms | — | 100.00% | — | 3 |
| 112 | z-ai/glm-4.7 limited data | venice | 1396 ms | 3133 ms | 1388 ms | — | 100.00% | — | 16 |
| 113 | qwen/qwen3-omni-30b-a3b-instruct limited data | novita | 1398 ms | 1398 ms | 1295 ms | — | 100.00% | — | 1 |
| 114 | minimax/minimax-m2.5 limited data | novita | 1401 ms | 1401 ms | 1399 ms | — | 100.00% | — | 1 |
| 115 | xiaomi/mimo-v2-flash | xiaomi | 1416 ms | 2598 ms | 1415 ms | — | 100.00% | — | 24 |
| 116 | deepseek/deepseek-chat-v3.1 limited data | phala | 1417 ms | 2524 ms | 1415 ms | — | 100.00% | — | 5 |
| 117 | minimax/minimax-m2 limited data | novita | 1423 ms | 1423 ms | 1319 ms | — | 100.00% | — | 1 |
| 118 | google/gemma-4-31b-it limited data | parasail | 1426 ms | 19795 ms | 1425 ms | — | 75.00% | — | 8 |
| 119 | arcee-ai/trinity-large-thinking limited data | parasail | 1428 ms | 1590 ms | 1325 ms | — | 100.00% | — | 6 |
| 120 | moonshotai/kimi-k2.6 | tinfoil | 1433 ms | 2900 ms | 1382 ms | — | 100.00% | — | 25 |
| 121 | google/gemma-4-26b-a4b-it limited data | gmi | 1435 ms | 14298 ms | 1434 ms | — | 63.16% | — | 19 |
| 122 | qwen/qwen3-coder-480b-a35b-instruct limited data | novita | 1439 ms | 1879 ms | 1438 ms | — | 100.00% | — | 2 |
| 123 | moonshotai/kimi-k2.6 limited data | phala | 1445 ms | 4619 ms | 1392 ms | — | 100.00% | — | 6 |
| 124 | z-ai/glm-5 limited data | venice | 1448 ms | 1633 ms | 1344 ms | — | 100.00% | — | 15 |
| 125 | deepseek-ai/DeepSeek-V3.2 limited data | nebius | 1449 ms | 1747 ms | 1447 ms | — | 100.00% | — | 3 |
| 126 | z-ai/glm-4.7-flash limited data | phala | 1450 ms | 7565 ms | 1346 ms | — | 100.00% | — | 7 |
| 127 | z-ai/glm-4.5v limited data | zai | 1464 ms | 10717 ms | 1362 ms | — | 82.35% | — | 17 |
| 128 | anthropic/claude-opus-4.8 limited data | anthropic | 1465 ms | 2365 ms | 1464 ms | — | 100.00% | — | 9 |
| 129 | qwen/qwen3-next-80b-a3b-instruct limited data | novita | 1477 ms | 1850 ms | 1374 ms | — | 100.00% | — | 4 |
| 130 | microsoft/wizardlm-2-8x22b limited data | novita | 1478 ms | 1664 ms | 1476 ms | — | 100.00% | — | 2 |
| 131 | xiaomi/mimo-v2.5-pro-ultraspeed | xiaomi | 1482 ms | 2795 ms | 1662 ms | 378 tok/s | 100.00% | — | 38 |
| 132 | qwen/qwen3-235b-a22b-instruct-2507 limited data | novita | 1482 ms | 2366 ms | 1480 ms | — | 100.00% | — | 2 |
| 133 | openai/gpt-5.5 limited data | openai | 1486 ms | 2095 ms | 1383 ms | — | 100.00% | — | 11 |
| 134 | z-ai/glm-4.6v limited data | zai | 1494 ms | 8278 ms | 1400 ms | — | 100.00% | — | 15 |
| 135 | deepseek/deepseek-ocr-2 limited data | novita | 1495 ms | 2138 ms | 1391 ms | — | 100.00% | — | 4 |
| 136 | minimax/minimax-m2.7 | minimax | 1495 ms | 3621 ms | 1392 ms | — | 100.00% | — | 22 |
| 137 | z-ai/glm-5v-turbo limited data | zai | 1499 ms | 1835 ms | 1397 ms | — | 100.00% | — | 10 |
| 138 | deepseek/deepseek-v3.2-exp limited data | novita | 1504 ms | 1504 ms | 1502 ms | — | 100.00% | — | 1 |
| 139 | mistralai/mistral-nemo limited data | novita | 1527 ms | 1527 ms | 1524 ms | — | 100.00% | — | 1 |
| 140 | qwen/qwen3-next-80b-a3b-thinking limited data | novita | 1534 ms | 1812 ms | 1431 ms | — | 100.00% | — | 2 |
| 141 | openai/o4-mini limited data | openai | 1536 ms | 2083 ms | 1534 ms | — | 100.00% | — | 9 |
| 142 | deepseek/deepseek-v4-pro | deepseek | 1540 ms | 2316 ms | 1473 ms | — | 100.00% | — | 68 |
| 143 | deepseek/deepseek-v4-pro limited data | siliconflow | 1543 ms | 2147 ms | 1439 ms | — | 100.00% | — | 11 |
| 144 | deepseek-ai/DeepSeek-V4-Pro limited data | nebius | 1544 ms | 4096 ms | 1485 ms | — | 100.00% | — | 9 |
| 145 | openai/gpt-oss-20b limited data | phala | 1551 ms | 1580 ms | 1448 ms | — | 100.00% | — | 3 |
| 146 | z-ai/glm-5 limited data | siliconflow | 1554 ms | 2317 ms | 1472 ms | — | 100.00% | — | 15 |
| 147 | nvidia/nemotron-3-super-120b-a12b limited data | nebius | 1562 ms | 1825 ms | 1560 ms | — | 100.00% | — | 3 |
| 148 | qwen/qwen3-235b-a22b-thinking-2507 limited data | novita | 1574 ms | 1574 ms | 1471 ms | — | 100.00% | — | 1 |
| 149 | qwen/qwen3.5-397b-a17b limited data | novita | 1581 ms | 1597 ms | 1495 ms | — | 100.00% | — | 2 |
| 150 | qwen/qwen3-coder-next limited data | novita | 1586 ms | 2176 ms | 1584 ms | — | 100.00% | — | 2 |
| 151 | anthropic/claude-opus-4.5 limited data | anthropic | 1604 ms | 2710 ms | 1603 ms | — | 100.00% | — | 13 |
| 152 | deepseek/deepseek-v3-0324 limited data | novita | 1605 ms | 2336 ms | 1604 ms | — | 100.00% | — | 4 |
| 153 | minimax/minimax-m2.1 limited data | novita | 1629 ms | 1629 ms | 1525 ms | — | 100.00% | — | 1 |
| 154 | moonshotai/kimi-k2-thinking limited data | novita | 1635 ms | 1852 ms | 1531 ms | — | 100.00% | — | 2 |
| 155 | minimax/minimax-m2.1-highspeed | minimax | 1639 ms | 3343 ms | 1620 ms | — | 100.00% | — | 26 |
| 156 | qwen/qwen3.5-397b-a17b limited data | venice | 1641 ms | 8745 ms | 1614 ms | — | 100.00% | — | 16 |
| 157 | deepseek/deepseek-v4-flash limited data | siliconflow | 1653 ms | 14180 ms | 1549 ms | — | 100.00% | — | 17 |
| 158 | minimax/minimax-m2.7-highspeed limited data | minimax | 1662 ms | 8083 ms | 1633 ms | — | 100.00% | — | 19 |
| 159 | Qwen/Qwen3-235B-A22B-Thinking-2507-fast limited data | nebius | 1671 ms | 1827 ms | 1568 ms | — | 100.00% | — | 5 |
| 160 | qwen/qwen3.5-122b-a10b limited data | novita | 1685 ms | 1685 ms | 1582 ms | — | 100.00% | — | 1 |
| 161 | minimax/minimax-m2.5-highspeed | minimax | 1689 ms | 3988 ms | 1568 ms | — | 100.00% | — | 30 |
| 162 | moonshotai/kimi-k2.5 | kimi | 1691 ms | 3441 ms | 1632 ms | — | 95.45% | — | 66 |
| 163 | qwen/qwen3-vl-8b-instruct limited data | novita | 1706 ms | 1706 ms | 1602 ms | — | 100.00% | — | 1 |
| 164 | minimax/minimax-m2.5 limited data | phala | 1709 ms | 2231 ms | 1707 ms | — | 100.00% | — | 3 |
| 165 | anthropic/claude-sonnet-4 limited data | anthropic | 1717 ms | 2331 ms | 1668 ms | — | 100.00% | — | 17 |
| 166 | deepseek/deepseek-v4-flash limited data | novita | 1725 ms | 2559 ms | 1621 ms | — | 100.00% | — | 2 |
| 167 | z-ai/glm-5-turbo limited data | venice | 1733 ms | 8245 ms | 1629 ms | — | 100.00% | — | 8 |
| 168 | openai/gpt-oss-120b-fast limited data | nebius | 1734 ms | 1899 ms | 1631 ms | — | 100.00% | — | 3 |
| 169 | Qwen/Qwen3-Next-80B-A3B-Thinking-fast limited data | nebius | 1745 ms | 1856 ms | 1642 ms | — | 100.00% | — | 3 |
| 170 | MiniMaxAI/MiniMax-M2.5-fast limited data | nebius | 1760 ms | 1995 ms | 1757 ms | — | 100.00% | — | 2 |
| 171 | deepseek-ai/DeepSeek-V3.2-fast limited data | nebius | 1766 ms | 2497 ms | 1662 ms | — | 100.00% | — | 8 |
| 172 | qwen/qwen3-omni-30b-a3b-thinking limited data | novita | 1771 ms | 1986 ms | 1667 ms | — | 100.00% | — | 3 |
| 173 | minimax/minimax-m2.7 limited data | novita | 1778 ms | 1909 ms | 1675 ms | — | 100.00% | — | 2 |
| 174 | Qwen/Qwen3-Next-80B-A3B-Thinking limited data | nebius | 1784 ms | 2076 ms | 1774 ms | — | 100.00% | — | 9 |
| 175 | moonshotai/Kimi-K2.5 limited data | nebius | 1786 ms | 2623 ms | 1755 ms | — | 100.00% | 1 probe_config_error |
5 |
| 176 | deepseek/deepseek-v4-pro limited data | novita | 1791 ms | 1791 ms | 1687 ms | — | 100.00% | — | 1 |
| 177 | z-ai/glm-5.1 limited data | phala | 1798 ms | 4624 ms | 1797 ms | — | 100.00% | — | 4 |
| 178 | deepseek/deepseek-v3.1-terminus limited data | novita | 1803 ms | 2207 ms | 1802 ms | — | 100.00% | — | 2 |
| 179 | Qwen/Qwen3.5-397B-A17B-fast limited data | nebius | 1804 ms | 2034 ms | 1699 ms | — | 100.00% | — | 6 |
| 180 | moonshotai/kimi-k2-0905 limited data | novita | 1834 ms | 2375 ms | 1832 ms | — | 100.00% | — | 4 |
| 181 | z-ai/glm-5v-turbo limited data | siliconflow | 1882 ms | 3596 ms | 1780 ms | — | 84.21% | — | 19 |
| 182 | qwen/qwen3.5-27b limited data | novita | 1897 ms | 4113 ms | 1795 ms | — | 100.00% | — | 2 |
| 183 | anthropic/claude-sonnet-4.6 limited data | anthropic | 1925 ms | 3056 ms | 1822 ms | — | 100.00% | — | 12 |
| 184 | z-ai/glm-4.7 limited data | zai | 1950 ms | 2531 ms | 1948 ms | — | 100.00% | — | 11 |
| 185 | deepseek/deepseek-v3-turbo limited data | novita | 1953 ms | 1953 ms | 1849 ms | — | 100.00% | — | 1 |
| 186 | xiaomimimo/mimo-v2.5-pro limited data | novita | 1955 ms | 2860 ms | 1953 ms | — | 100.00% | — | 2 |
| 187 | anthropic/claude-sonnet-4.5 limited data | anthropic | 1969 ms | 3721 ms | 1885 ms | — | 100.00% | — | 15 |
| 188 | minimax/minimax-m2.5-highspeed limited data | novita | 1975 ms | 2132 ms | 1871 ms | — | 100.00% | — | 2 |
| 189 | moonshotai/kimi-k2.6 | kimi | 1988 ms | 2589 ms | 1897 ms | — | 90.00% | — | 50 |
| 190 | deepseek/deepseek-v3.1 limited data | novita | 2007 ms | 2369 ms | 2006 ms | — | 100.00% | — | 3 |
| 191 | openai/o3-mini limited data | openai | 2015 ms | 2467 ms | 2012 ms | — | 100.00% | — | 12 |
| 192 | baidu/ernie-4.5-vl-424b-a47b limited data | novita | 2036 ms | 2036 ms | 1933 ms | — | 100.00% | — | 1 |
| 193 | z-ai/glm-4.6 limited data | zai | 2074 ms | 2740 ms | 2044 ms | — | 92.86% | — | 14 |
| 194 | moonshotai/kimi-k2.6 limited data | novita | 2097 ms | 2097 ms | 1993 ms | — | 100.00% | — | 1 |
| 195 | zai-org/GLM-5 limited data | nebius | 2100 ms | 2581 ms | 1997 ms | — | 100.00% | — | 5 |
| 196 | tencent/hunyuan-a13b-instruct limited data | siliconflow | 2134 ms | 3382 ms | 2031 ms | — | 100.00% | — | 17 |
| 197 | minimax/minimax-m3 | siliconflow | 2135 ms | 5025 ms | 2134 ms | — | 61.90% | — | 21 |
| 198 | deepseek/deepseek-r1-0528 limited data | novita | 2138 ms | 2138 ms | 2136 ms | — | 100.00% | — | 1 |
| 199 | anthropic/claude-opus-4 limited data | anthropic | 2161 ms | 4938 ms | 2058 ms | — | 100.00% | — | 14 |
| 200 | zai-org/glm-4.7 limited data | novita | 2180 ms | 2180 ms | 2178 ms | — | 100.00% | — | 1 |
| 201 | moonshotai/kimi-k2-instruct limited data | novita | 2187 ms | 2187 ms | 2084 ms | — | 100.00% | — | 1 |
| 202 | zai-org/glm-4.6v limited data | novita | 2209 ms | 2209 ms | 2107 ms | — | 100.00% | — | 1 |
| 203 | deepseek/deepseek-ocr limited data | novita | 2227 ms | 2227 ms | 2123 ms | — | 100.00% | — | 1 |
| 204 | anthropic/claude-opus-4.6 limited data | anthropic | 2255 ms | 2747 ms | 2152 ms | — | 100.00% | — | 12 |
| 205 | google/gemini-3.1-pro-preview limited data | gemini | 2255 ms | 3989 ms | 2254 ms | — | 100.00% | 12 probe_config_error |
11 |
| 206 | moonshotai/kimi-k2.5 limited data | phala | 2264 ms | 4817 ms | 2160 ms | — | 100.00% | — | 8 |
| 207 | tencent/hy3-preview | siliconflow | 2274 ms | 3280 ms | 2171 ms | — | 100.00% | — | 22 |
| 208 | deepseek/deepseek-r1-turbo limited data | novita | 2284 ms | 2400 ms | 2180 ms | — | 100.00% | — | 3 |
| 209 | deepseek/deepseek-v3.2 limited data | novita | 2301 ms | 2811 ms | 2199 ms | — | 100.00% | — | 3 |
| 210 | z-ai/glm-4.7 limited data | phala | 2311 ms | 2900 ms | 2288 ms | — | 100.00% | — | 7 |
| 211 | z-ai/glm-5 limited data | phala | 2322 ms | 16618 ms | 2217 ms | — | 66.67% | — | 6 |
| 212 | xiaomi/mimo-v2.5-pro | xiaomi | 2360 ms | 4561 ms | 2257 ms | — | 100.00% | — | 31 |
| 213 | moonshotai/kimi-k2.6 limited data | parasail | 2361 ms | 7775 ms | 2257 ms | — | 66.67% | — | 6 |
| 214 | minimax/minimax-m3 | minimax | 2363 ms | 4595 ms | 2350 ms | — | 100.00% | — | 22 |
| 215 | zai-org/glm-4.6 limited data | novita | 2374 ms | 2759 ms | 2371 ms | — | 100.00% | — | 4 |
| 216 | z-ai/glm-5v-turbo limited data | venice | 2385 ms | 6973 ms | 2383 ms | — | 100.00% | — | 10 |
| 217 | anthropic/claude-opus-4.1 limited data | anthropic | 2407 ms | 3439 ms | 2304 ms | — | 100.00% | — | 9 |
| 218 | qwen/qwen3.5-397b-a17b limited data | phala | 2443 ms | 3305 ms | 2340 ms | — | 100.00% | — | 9 |
| 219 | xiaomi/mimo-v2-pro | xiaomi | 2472 ms | 3635 ms | 2368 ms | — | 100.00% | — | 20 |
| 220 | deepseek/deepseek-v3.2 limited data | phala | 2530 ms | 3700 ms | 2529 ms | — | 100.00% | — | 4 |
| 221 | xiaomi/mimo-v2.5 | xiaomi | 2706 ms | 4143 ms | 2696 ms | — | 100.00% | — | 27 |
| 222 | z-ai/glm-5 limited data | zai | 2729 ms | 3942 ms | 2627 ms | — | 100.00% | — | 9 |
| 223 | z-ai/glm-5.1 limited data | zai | 2781 ms | 4838 ms | 2679 ms | — | 100.00% | — | 6 |
| 224 | zai-org/glm-5 limited data | novita | 2952 ms | 2952 ms | 2849 ms | — | 100.00% | — | 1 |
| 225 | deepseek/deepseek-v4-pro | gmi | 2981 ms | 5094 ms | 2958 ms | — | 96.67% | — | 30 |
| 226 | google/gemma-4-31b-it | gmi | 3126 ms | 11709 ms | 3072 ms | — | 69.57% | — | 23 |
| 227 | openai/o1 limited data | openai | 3194 ms | 4254 ms | 3092 ms | — | 100.00% | — | 12 |
| 228 | moonshotai/kimi-k2.5 limited data | parasail | 3422 ms | 7893 ms | 3421 ms | — | 100.00% | — | 5 |
| 229 | z-ai/glm-5 | gmi | 3688 ms | 9982 ms | 3585 ms | — | 84.62% | — | 26 |
| 230 | z-ai/glm-5.1 | tinfoil | 3705 ms | 15106 ms | 3680 ms | — | 37.14% | — | 35 |
| 231 | zai-org/GLM-5.1 limited data | nebius | 3899 ms | 11386 ms | 3796 ms | — | 100.00% | — | 6 |
| 232 | qwen/qwen3-max limited data | novita | 4759 ms | 4759 ms | 4657 ms | — | 100.00% | — | 1 |
| 233 | z-ai/glm-5.1 | gmi | 5402 ms | 13482 ms | 5298 ms | — | 96.55% | — | 29 |
| 234 | z-ai/glm-4.7 limited data | parasail | 6293 ms | 7384 ms | 6190 ms | — | 75.00% | — | 4 |
| 235 | zai-org/glm-4.7-flash limited data | novita | 9738 ms | 9738 ms | 9737 ms | — | 100.00% | — | 1 |
| 236 | baidu/ernie-4.5-vl-28b-a3b limited data | novita | — | — | — | — | 0.00% | — | 2 |
| 237 | stepfun/step-3.5-flash limited data | parasail | — | — | — | — | 0.00% | — | 6 |
| 238 | google/gemma-3-12b-it limited data | novita | — | — | — | — | 0.00% | — | 3 |
| 239 | elephant limited data | novita | — | — | — | — | 0.00% | — | 3 |
| 240 | deepseek/deepseek-v4-pro limited data | parasail | — | — | — | — | 0.00% | 3 probe_config_error |
2 |
| 241 | kwaipilot/kat-coder-pro limited data | novita | — | — | — | — | 0.00% | — | 1 |
| 242 | sao10k/l31-70b-euryale-v2.2 limited data | novita | — | — | — | — | 0.00% | — | 1 |