Agent Cost Calculator
Estimate the true cost of agentic workflows: multi-step LLM calls, tool execution, retries, task failures, and human review โ per task and at production volume.
Workflow Configuration
LLM Model
Fast & cheap; good for structured agent steps ยท $0.4/1M in ยท $1.6/1M out
Workflow Steps (LLM Calls per Task)
Total LLM calls/task: 6
Token Profile per Call
Error & Review Rates
0%50%
0%30%
0%100%
Production Volume
Configure your workflow and click Calculate
Cost breakdown by step will appear here
Agent Cost Optimization Tips
- Use smaller models for tool-parsing steps. Route planning & synthesis to GPT-4.1 or Claude Sonnet, but use Haiku/Flash/nano for JSON extraction and validation steps โ 5โ20ร cheaper.
- Reduce tool schema tokens. Trimming tool descriptions from 600 to 200 tokens cuts input costs on every tool-call LLM call. Use concise tool definitions.
- Cache repeated context. System prompts, tool schemas, and static RAG content repeated across calls are candidates for prompt caching (Anthropic, OpenAI both support this).
- Track retry rate closely. A 20%+ retry rate usually signals prompt engineering issues โ structured outputs (JSON mode, constrained decoding) often eliminate most retries.
- Failure waste compounds fast. A 10% task failure rate on 1000 tasks/day means 100 tasks worth of LLM cost with zero business value. Add guardrails and early termination.