Model Comparison
Compare cost, speed and context across every provider — and pick your primary model.
Primary
Op
GPT-4o
OpenAI
Input / 1K
$0.005
Output / 1K
$0.015
Context128K tokens
Speed109 tok/s
Latency480ms
Requests (mo)421,882
An
Claude 3.5 Sonnet
Anthropic
Input / 1K
$0.003
Output / 1K
$0.015
Context200K tokens
Speed87 tok/s
Latency540ms
Requests (mo)347,405
Op
GPT-4 Turbo
OpenAI
Input / 1K
$0.01
Output / 1K
$0.03
Context128K tokens
Speed61 tok/s
Latency720ms
Requests (mo)148,902
GoGoogle
Gemini 1.5 Pro
Input / 1K
$0.00125
Output / 1K
$0.005
Context1M tokens
Speed94 tok/s
Latency610ms
Requests (mo)173,544
An
Claude 3 Opus
Anthropic
Input / 1K
$0.015
Output / 1K
$0.075
Context200K tokens
Speed41 tok/s
Latency1180ms
Requests (mo)86,771
Me
Llama 3.1 70B
Meta
Input / 1K
$0.00088
Output / 1K
$0.00088
Context128K tokens
Speed132 tok/s
Latency390ms
Requests (mo)61,984
Compare models
Output cost ($ / 1K tokens)