This is a demo report. Numbers come from PitCrew’s real engine on a hand-curated agent description.
Audit your own agent →
Trying google Gemini 2.5 Pro as the default build. The headline cost reflects this swap; the optimization plan + cascade still come from the original audit on openai GPT-4o.
← Back to original
Audit · May 3, 2026Default: google · Gemini 2.5 Pro

Your forecast is in

Single-turn RAG agent that retrieves chunks from our internal engineering docs (architecture decisions, runbooks, API references) and answers questions. ~120 engineers ask 4-5 questions a day.

Alternative models

Same quality tier, your wizard inputs. No caching or batch applied — every row is a directly-comparable raw monthly cost. Click Try as default to re-render this report with that model as the new baseline.

ModelInput $/MtokOutput $/MtokContextMonthly costvs defaultOpen in audit
deepseekV4
codingreasoning
$0.30$0.50$2/mo$-33/moTry as default →
deepseekR1
complex reasoning
$0.55$2$8/mo$-28/moTry as default →
mistralLarge 2
multilingualreasoning
$2$6$22/mo$-13/moTry as default →
googleGemini 2.5 Pro
long contextmultimodal
$1$10$33/mo$-3/moTry as default →
openaiGPT-4o
Default
multimodal
$3$10$36/moTry as default →
openaiGPT-5.2
balanced
$2$14$46/mo+$10/moTry as default →
anthropicSonnet 4.6
general purposebalanced
$3$15$52/mo+$16/moTry as default →

Run another audit
for a different build

Tweak inputs, swap the model, see how the forecast moves.

New audit