OpenAI compatible API. Attested gateway. Public status.

Meta: Llama 3.3 70B Instruct Benchmarks

Benchmark and measurement links for Meta: Llama 3.3 70B Instruct, with TrustedRouter route data first.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

meta-llama/llama-3.3-70b-instruct

Benchmarks

All models

Published benchmark scores

First-party model-card and open-leaderboard scores for Meta: Llama 3.3 70B Instruct — every row links to its primary source. TrustedRouter does not run these evals; we cite them, and only attach a score to the exact checkpoint it was measured on.

BenchmarkCategoryScoreSource
HumanEval Coding 88.4% Meta — Llama 3.3 70B model card
2024-12-06
IFEval Instruction following 92.1% Meta — Llama 3.3 70B model card
2024-12-06
MMLU
0-shot, CoT
Knowledge 86.0% Meta — Llama 3.3 70B model card
2024-12-06
MMLU-Pro
CoT
Knowledge 68.9% Meta — Llama 3.3 70B model card
2024-12-06
MATH
0-shot, CoT
Math 77.0% Meta — Llama 3.3 70B model card
2024-12-06
GPQA Diamond
0-shot, CoT
Science 50.5% Meta — Llama 3.3 70B model card
2024-12-06

TrustedRouter measurements

TrustedRouter publishes route and status measurements without storing prompt or output content. Provider latency and uptime are exposed through the model performance and uptime pages.

External benchmark references

Sign in

Choose a sign in method.