Command palette

Search pages and run quick actions

AI Usage Analytics

Cost, latency and request volume across every model and endpoint.

Total Requests

0

+12.8%vs last month

Avg Latency

0ms

-6.2%vs last month

Error Rate

0.00%

-0.1%vs last month

Throughput

0 tok/s

+3.4%vs last month

Cost breakdown by model

Daily spend across 5 models

Request volume & latency

Requests per day with average latency overlay

Usage share by model
  • GPT-4o34%
  • Claude 3.5 Sonnet28%
  • GPT-4 Turbo12%
  • Gemini 1.5 Pro14%
  • Claude 3 Opus7%
  • Llama 3.1 70B5%
Top endpoints

Highest-volume API endpoints this period

EndpointRequestsAvg latencyError rate
/v1/chat/completions842,109510ms0.4%
/v1/embeddings318,402120ms0.1%
/v1/agents/run184,938690ms1.2%
/v1/completions96,204480ms0.6%
/v1/moderations71,88290ms0.0%
/v1/files42,119240ms0.3%