COSTB2B SAAS · SERIES A4 DAYS

LLM cost optimization audit

A four-day audit cut their LLM bill by 60% — no quality regression on a 420-question eval suite — and paid for two years of Growth in the first month.

Engagement

CustomerB2B SaaS · Series A

StageSeries A · $7.1M ARR

Task typeCost & quality audit

TierPilot ($3,000)

Days to ship4 business days

Outcome

−60%

Monthly LLM spend

$48K → $19K

Per-month run-rate

Quality regressions on eval set

2 yrs

Of Growth tier paid back in 1 month

The customer

A B2B SaaS at Series A burning roughly $48K/month on LLM API spend across two AI features (a chat copilot and a summarisation pipeline). Both shipped fast in 2024 and were never tuned afterwards.

The task they submitted

Audit our LLM usage end-to-end. Find spend we can cut without losing quality. Build the eval suite that proves it.

Our approach

Day 1: traffic profiling — 80% of spend was on a single endpoint that didn't need GPT-4-class reasoning. Day 2: built a 420-question eval suite from 90 days of real customer logs, with automated grading. Day 3: switched the bulk path to a smaller open model behind a vLLM gateway, tuned the prompts, added prompt caching to the remaining frontier-model calls. Day 4: rolled out behind a 10% canary, validated against the eval set, then 100%.

The outcome

$48K → $19K monthly run-rate. Zero regressions on the eval suite. The customer used the savings to fund three more Boundev tasks across the next quarter.

QUOTE · FOUNDER & CEO

“$48K to $19K a month, no quality regression. Paid for two years of Growth in the first month.”

S. Roy

Founder & CEO

More case studies

What we shipped, in days not quarters.

RAG6 days

VERTICAL SAAS · LEGAL TECH

Have a similar task?

We'll scope it free in 20 minutes.

Bring your AI feature, your stack, your deadline. We'll tell you if we're a fit, what tier you need, and how fast we can ship — whether or not you sign up.

Book free scoping call See pricing