Command palette

Search pages and run quick actions

Model Comparison

Compare cost, speed and context across every provider — and pick your primary model.

Primary
Op

GPT-4o

OpenAI

Input / 1K

$0.005

Output / 1K

$0.015

Context128K tokens
Speed109 tok/s
Latency480ms
Requests (mo)421,882
An

Claude 3.5 Sonnet

Anthropic

Input / 1K

$0.003

Output / 1K

$0.015

Context200K tokens
Speed87 tok/s
Latency540ms
Requests (mo)347,405
Op

GPT-4 Turbo

OpenAI

Input / 1K

$0.01

Output / 1K

$0.03

Context128K tokens
Speed61 tok/s
Latency720ms
Requests (mo)148,902
Go

Gemini 1.5 Pro

Google

Input / 1K

$0.00125

Output / 1K

$0.005

Context1M tokens
Speed94 tok/s
Latency610ms
Requests (mo)173,544
An

Claude 3 Opus

Anthropic

Input / 1K

$0.015

Output / 1K

$0.075

Context200K tokens
Speed41 tok/s
Latency1180ms
Requests (mo)86,771
Me

Llama 3.1 70B

Meta

Input / 1K

$0.00088

Output / 1K

$0.00088

Context128K tokens
Speed132 tok/s
Latency390ms
Requests (mo)61,984
Compare models

Output cost ($ / 1K tokens)