Model Comparison

Compare cost, speed and context across every provider — and pick your primary model.

Primary

GPT-4o

OpenAI

Input / 1K

$0.005

Output / 1K

$0.015

Context128K tokens

Speed109 tok/s

Latency480ms

Requests (mo)421,882

Claude 3.5 Sonnet

Anthropic

Input / 1K

$0.003

Output / 1K

$0.015

Context200K tokens

Speed87 tok/s

Latency540ms

Requests (mo)347,405

GPT-4 Turbo

OpenAI

Input / 1K

$0.01

Output / 1K

$0.03

Context128K tokens

Speed61 tok/s

Latency720ms

Requests (mo)148,902

Gemini 1.5 Pro

Google

Input / 1K

$0.00125

Output / 1K

$0.005

Context1M tokens

Speed94 tok/s

Latency610ms

Requests (mo)173,544

Claude 3 Opus

Anthropic

Input / 1K

$0.015

Output / 1K

$0.075

Context200K tokens

Speed41 tok/s

Latency1180ms

Requests (mo)86,771

Llama 3.1 70B

Command palette

Model Comparison