Agent Cost Calculator

Estimate the true cost of agentic workflows: multi-step LLM calls, tool execution, retries, task failures, and human review โ€” per task and at production volume.

Workflow Configuration

LLM Model

Fast & cheap; good for structured agent steps ยท $0.4/1M in ยท $1.6/1M out

Workflow Steps (LLM Calls per Task)

Total LLM calls/task: 6

Token Profile per Call

Error & Review Rates

0%50%
0%30%
0%100%

Production Volume

Configure your workflow and click Calculate

Cost breakdown by step will appear here

Agent Cost Optimization Tips

  • Use smaller models for tool-parsing steps. Route planning & synthesis to GPT-4.1 or Claude Sonnet, but use Haiku/Flash/nano for JSON extraction and validation steps โ€” 5โ€“20ร— cheaper.
  • Reduce tool schema tokens. Trimming tool descriptions from 600 to 200 tokens cuts input costs on every tool-call LLM call. Use concise tool definitions.
  • Cache repeated context. System prompts, tool schemas, and static RAG content repeated across calls are candidates for prompt caching (Anthropic, OpenAI both support this).
  • Track retry rate closely. A 20%+ retry rate usually signals prompt engineering issues โ€” structured outputs (JSON mode, constrained decoding) often eliminate most retries.
  • Failure waste compounds fast. A 10% task failure rate on 1000 tasks/day means 100 tasks worth of LLM cost with zero business value. Add guardrails and early termination.