7-DAY FREE TRIALNO CREDIT CARD

Cut your AI costs by 30-50%.
Without changing your code.

RelayPlane routes API calls to the cheapest model that works. One line to install. Falls back automatically if routing fails.

Save 30% with free tier
Save 50%+ with Pro
$npm install -g @relayplane/proxy && relayplane-proxy
We never see your prompts
<5ms latency overhead
Auto-fallback on failure

How much could you save?

Enter your monthly API spend

$/month
$0$500$1,000+
Free tier (static rules)~$60/mo saved
~30% savings
Pro tier (Swarm routing)~$100/mo saved
~50% savings
Pro cost-$29/mo
Net savings with Pro+$71/mo

Pro pays for itself and saves you $11 more than Free

7-day Pro trial, then Free tier or $29/mo

“I was spending $200/month on Claude API calls for my coding agent. After installing RelayPlane, it dropped to $87. Same quality output—it just stopped using Opus for simple tasks like file reads and status checks.”

Matt Turley

Matt Turley

Founder, RelayPlane · 20 years building software

Yes, this is the founder's own story. Early user testimonials coming soon.

// THE SWARM

How intelligent routing actually works

The Swarm explained in 3 sentences

1

Analyze metadata

Looks at task type, token count, and complexity signals — never your actual prompts.

2

Check patterns

What worked for similar requests across all users? “Code review under 3K tokens succeeds 94% on Haiku.”

3

Pick optimal model

Chooses the cheapest model that meets your quality threshold — if you need 95% success, it won't risk 87%.

Network effect

Every user (free or paid) contributes anonymized success/fail data. More users = smarter routing for everyone. We're early—currently 50+ beta users contributing routing data.

Your
Agent
RelayPlane
Proxy
Swarm
API
// Request metadata
task: "code_review"
tokens: 2K
lang: "Python"
Swarm Decision
✓ 94% success on Haiku for this pattern
Sonnet is 96% but costs 8x more
→ Route to Haiku
Haiku
Response
Your Agent
If Haiku fails →automatic retry on Sonnet→ no manual intervention
// RELIABILITY

What if routing picks the wrong model?

This is the fear everyone has. Here's exactly what happens.

Scenario: Haiku returns garbage

// Automatic recovery:
1. Detect poor quality output
2. Retry immediately with Sonnet
3. Record failure for future routing
4. Your agent gets a valid response ✓

Worst case: You paid for both calls. Still cheaper than always using Opus.

Built-in fallback chains

// Default chain:
HaikuSonnetOpus
// Or configure your own:
GPT-4o-miniGPT-4oClaude

Circuit breakers: If a model fails 5x in a row, we skip it automatically for 30 seconds.

The math still works: Even with occasional fallback calls, users save 30-50% because most simple tasks do succeed on cheaper models. You're not paying Opus prices for file reads.

// SETUP

Start saving in 60 seconds

Three steps. Zero code changes.

1

Install the proxy

npm install -g @relayplane/proxy

MIT licensed. Runs 100% locally.

2

Point your agent at it

export ANTHROPIC_BASE_URL=localhost:3001

One environment variable. No code changes.

3

Watch your costs drop

relayplane stats

Real-time dashboard shows exactly what you're saving.

// ADVANCED CAPABILITIES

More than just a proxy

RelayPlane is a complete agent operations platform. Here's what's built and ready.

FEATURED

Policy Engine

Complete control over which models can be used, when, and by whom. Set allowlists, blocklists, cost thresholds, and approval requirements. “Never use GPT-4 for customer data” — enforced automatically.

Included in Max tier

Compliance & Audit

Decision Audit Trail

Know exactly why each call was routed where it was. Full explainability for compliance.

Approval Gates

Require human approval for expensive operations. Multi-approver workflows.

Operations & Cost Control

Budget Controls

Hard limits by day/week/month. Get alerts before you blow through spend.

Pattern Detection

Spots anomalies, cost spikes, and failure patterns. Suggests optimizations.

Team Collaboration

Teams & Workspaces

Multi-user access with role-based permissions. Shared learnings across team agents. Separate budgets per team.

Full dashboard UI included (53 components). This isn't vaporware.

See documentation →
// PRICING

Free forever. Pro when you're ready.

7-day free trial on Pro — no credit card required.

Free

$0 forever

For side projects and experimentation.

  • Local proxy
  • Static routing rules
  • Local learning
  • 30% average savings
  • MIT licensed
  • Offline mode
POPULAR

Pro

$29/mo

For indie hackers running agents daily.

  • Everything in Free
  • Swarm API (50%+ savings)
  • Network intelligence
  • Cloud dashboard
  • Anomaly alerts
  • 30-day history

Max

$99/mo

Everything included — no limits, full enterprise features.

  • Everything in Pro
  • Unlimited team seats
  • SSO/SAML authentication
  • Audit logs (exportable)
  • Self-hosted option
  • SLA guarantee
  • Dedicated support
Feature
Free
Pro
Max
Swarm API routing
Cloud dashboard
Team seats
1
1
Unlimited
SSO/SAML
Policy engine
History retention
Local
30 days
Custom
Core Routing
Local proxy routing
Static routing rules
Swarm API (intelligent routing)
Network learning pool
Auto-fallback chains
Custom model ordering
Dashboard
Local savings dashboard
Cloud dashboard
Decision explainability
Cost anomaly alerts
Data export (CSV/JSON)
History retention
Local
30 days
Custom
Governance
Policy engine
Budget controls
Approval gates
Model allowlists/blocklists
Audit logs (exportable, signed)
Team & Security
Team seats
1
1
Unlimited
Shared learnings
SSO/SAML authentication
Self-hosted deployment
API access
Support
Community support
Email support
Dedicated support
SLA guarantee

Break-even math: Pro pays for itself at $60/mo API spend. If you're spending $100+/mo, you'll save $20-50/mo after the subscription.

// PRIVACY-FIRST ARCHITECTURE

We never see your prompts.

Your data stays on your machine. We only see anonymized metadata for routing decisions.

What we collect

Anonymized metadata only

device_id"anon_8f3a..."
task_type"code_review"
model"claude-3-5-haiku"
tokens1847 / 423
successtrue

What we NEVER collect

Zero access to your content

Your prompts or instructions
Model responses or outputs
File paths, code, or contents
Anything that identifies you

Verify it yourself

See exactly what gets sent

relayplane-proxy --audit
// FAQ

Common questions

Technical

What if the cheap model returns garbage?

RelayPlane automatically retries with a better model. You pay for both calls, but that's still cheaper than always using Opus. The Swarm learns from failures so routing improves over time.

How much latency does this add?

Local proxy: <5ms. Swarm API call: 50-100ms. For a 2-5 second LLM call, this is negligible. If latency matters more than cost, configure more aggressive model selection.

What if the Swarm API goes down?

Proxy falls back to static rules automatically. Your agent keeps working. This is baked into the architecture—we assume network calls can fail.

Trust & Business

Can I audit exactly what's being sent?

Yes. Run `relayplane-proxy --audit` to see every payload before it's sent. Or read the MIT-licensed source code on GitHub.

What if you shut down?

The local proxy works forever—it's MIT licensed software on your machine. Only the Swarm API would stop. You'd keep ~30% savings on static rules.

Will this break my agent?

No. If RelayPlane can't route, it passes through to your default model. Worst case: you pay what you would have paid anyway.

How do I know I'm actually saving money?

Run `relayplane stats` or check the cloud dashboard. We show you exactly how much you've saved vs. what you would have spent.

// WHO MADE THIS

Built by a real person.

Matt Turley

Matt Turley

Solo Founder · 20 years in software

I built RelayPlane because I was spending $200/mo on Claude API calls and realized most of them didn't need Opus. Now I spend $87.

This is a real product from a real person, not a VC-funded growth hack. I've been independent for 16 years and I'll be around to fix bugs and answer questions.

Start saving today.

7-day free trial. No credit card required. Downgrade to free tier anytime.

$npm install -g @relayplane/proxy && relayplane-proxy
OPEN SOURCEMIT LICENSELOCAL-FIRSTAUTO-FALLBACK