aiprompt.fyi
Student

The LLM Leaderboard, ranked by V-Index.

An independent ranking of the major 2026 frontier models. The V-Index divides curated quality (1–10) by USD per 1M input tokens — higher is better Bang for the Buck.

Best V-Index
DeepSeek V3.2
V-Index 28.93 · $0.28/1M
Highest Quality
Claude 4.6
Quality 9.4 · $3/1M
Cheapest
DeepSeek V3.2
$0.28/1M · Quality 8.1
Sort by
#ModelVendorQuality /10$/1M tokV-Index
01
DeepSeek V3.2
The Switch & Save disruptor
DeepSeek8.1$0.2828.93
02
Qwen 3 Max
Alibaba8.2$0.4020.50
03
Llama 4 Maverick
Open-weights workhorse
Meta8.3$0.5016.60
04
Gemini 3 Ultra
Google8.9$1.505.93
05
Mistral Large 3
EU-hosted
Mistral8.5$1.804.72
06
Grok 4.20
xAI8.4$2.004.20
07
GPT-5
OpenAI9.2$2.503.68
08
Claude 4.6
Anthropic9.4$3.003.13

Precision by task

Pick a task type — each dot reflects how reliably the model handles that workload in our audit corpus. Green = High, amber = Medium, red = Low.

ModelPrecision · ReasoningV-Index
DeepSeek V3.2High28.93
Qwen 3 MaxHigh20.50
Llama 4 MaverickHigh16.60
Gemini 3 UltraHigh5.93
Mistral Large 3High4.72
Grok 4.20High4.20
GPT-5High3.68
Claude 4.6High3.13
Methodology

How the V-Index is calculated

The V-Index is a single number — quality divided by USD per 1M input tokens — designed to capture Bang for the Buck rather than raw quality. Curated quality scores (1–10) are reviewed quarterly across reasoning, coding and factual benchmarks; pricing tracks each vendor's published list price for input tokens.

V-Index = quality / price-per-1M-tokens. A model at quality 8.0 priced at $0.40/1M scores 20.0. A model at quality 9.4 priced at $3.00/1M scores 3.13. Both can be the right answer — depending on the workload.

Updated quarterly. Last review: Q2 2026. Pricing reflects published list prices in USD per 1M input tokens. Enterprise discounts, cached-token pricing and output-token premiums are not included.

Don't pick a model. Audit your prompt.

The best model for your prompt depends on the prompt itself. Paste yours and we'll score it against every model on this board.

Run a free audit