COST INTELLIGENCE FOR OPENCLAW

Your agents are burning money.
Now you can see why.

RelayPlane sits between your OpenClaw agents and the LLM providers they call. It tracks every dollar, shows you exactly where it goes, and gives you the tools to cut waste.

Install FreeOpen Source on GitHub

OPEN SOURCE · MIT LICENSE · FREE FOREVER

🔒 relayplane.com/dashboard
Runs
Policies
Routing
Analytics
Proxy Active
Runs
Monitor and analyze agent runs in real-time
Live↻ Refresh
Total Runs
0
↗ +14.5% vs last week
Total Cost
$0
↗ +11.2% vs last week
Avg Latency
0ms
↘ -12.8% vs last week
Success Rate
0%
↗ +1.5% vs last week
#a3f7ccoder-mainclaude-opus · 5m ago$0.471.2s
#b8e2dcoder-mainclaude-sonnet · 12m ago$0.08623ms
#c4f9aci-agentgpt-4o · 25m ago$0.0430.0s
#d7e1bresearch-agentclaude-opus · 45m ago$0.000ms
#e2c8fcoder-mainclaude-opus · 30s ago$0.240ms
Analytics
Cost breakdown and savings analysis

Savings This Week

$168.40
saved vs. unoptimized routing

Cost by Model

Opus $56
Sonnet $41
Haiku $32
GPT-4o $18

Cost per Model (7 days)

claude-opus$56.20
claude-sonnet$41.10
claude-haiku$31.80
gpt-4o$18.13
Routing
Live routing decisions
Read file: src/utils/config.ts
SimpleHaikusaved $0.14
2s ago
Generate unit tests for auth module
MediumSonnetsaved $0.08
18s ago
Refactor database schema for multi-tenancy
ComplexOpuscorrect tier
34s ago
Format JSON output from API response
SimpleHaikusaved $0.12
1m ago
Implement WebSocket reconnection logic
MediumSonnetsaved $0.11
2m ago
Policies
Active routing rules
if task = SIMPLE then route → haikuFile reads, small edits, simple questions
Active · matched 847 times today
if task = MEDIUM then route → sonnetCode generation, refactoring, test writing
Active · matched 312 times today
if context > 50K tokens then route → gemini-proLarge context window — cheaper per token
Active · matched 88 times today
if monthly_spend > $200 then alert via telegramBudget guardrail — notify before overspend
Active · budget: $147.23 / $200
THE PROBLEM

Anthropic banned subscription tokens.
Your bill just 6x'd.

On February 18, Anthropic banned using Max plan tokens in OpenClaw and similar tools. Users who were paying $200/month now face $600–2,000+ in API costs for the same usage.

But the cost crisis started before the ban. The real problem: your agent uses the most expensive model for everything. Code formatting? Opus. Reading a file? Opus. Simple questions? Opus.

73% of typical agent spend is on the wrong model for the task.
Before RelayPlane
Simple tasks (file reads, edits)Opus$142/moWaste
Medium tasks (code generation)Opus$38/moOverspend
Complex tasks (architecture)Opus$18/moCorrect
Retried failuresOpus$24/moPreventable
Without RelayPlane
$222/mo
With RelayPlane
$52/mo
You save
$170/mo
77% less
How it works
Simple tasksOpus → Haiku$142 → $12
Medium tasksOpus → Sonnet$38 → $22
Complex tasksOpus → Opus$18 → $18No change
Retried failuresEliminated$24 → $0Avoided

How much could you save?

Enter your monthly API spend

$/month
Free tier (smart routing)
−$150
Contributor tier (mesh routing — coming soon)
−$300
Net savings: +$300/mo
HOW IT WORKS

Three pillars. One install.

Pillar 1 · Observe

See where every dollar goes

Every LLM request flows through RelayPlane. Cost per request, model used, task type, tokens consumed, latency — all tracked automatically. Your first "aha" moment: seeing that 73% of your spend is on the wrong model.

  • Cost per request
  • Model breakdown
  • Task classification
  • Latency tracking
Pillar 2 · Govern

Route to the cheapest model that works

A policy engine classifies each task and routes it to the most cost-effective model. Simple file reads go to Haiku ($0.25/M tokens). Complex architecture goes to Opus ($15/M tokens). You set the rules — or use our defaults.

  • Task classification
  • Policy rules
  • Model fallbacks
  • Custom overrides
Pillar 3 · Learn

Every agent makes yours smarter

Coming soon: opt in to the collective mesh. Share anonymized routing outcomes with other agents on the network. As the mesh grows, routing gets smarter for everyone — automatically.

  • Collective intelligence
  • Model effectiveness scores
  • Failure pattern detection
  • Network effect
GET STARTED

One command. Three minutes.

npm install -g @relayplane/proxy && relayplane init

Built for OpenClaw. Works with any agent framework.
No config files. No BASE_URL changes. No risk — if RelayPlane goes down, your agent keeps working.
Run relayplane stats in your terminal for a quick cost summary.

Step 1
Install
npm install
Step 2
Start proxy
relayplane start
Step 3
Open dashboard
localhost:4100
Step 4
See savings
First hour

Supports: Anthropic · OpenAI · Google Gemini · xAI/Grok · OpenRouter · DeepSeek · Groq · Mistral · Together · Fireworks · Perplexity

ZERO RISK

If RelayPlane crashes,
your agent doesn't notice.

We learned this the hard way. Early versions hijacked provider URLs — one crash took everything down for 8 hours. Never again.

RelayPlane uses a circuit breaker architecture. After 3 failures, all traffic bypasses the proxy automatically. Your agent talks directly to the provider. When RelayPlane recovers, traffic resumes. No manual intervention.

Closed
Normal operation
Open
After 3 failures
Half-Open
Probe recovery
Closed
Recovered
THE NETWORK

Routing that gets smarter
from every user.

COMING SOON

When you opt into the mesh, RelayPlane will report anonymized routing outcomes: "task type X + model Y = success/failure." No prompts. No code. No user data. Just operational signals. The data below is projected, not live.

Routing
Claude Haiku handles file read tasks with 99.2% success rate across 12,847 routing decisions · avg cost $0.002/request
confidence: 0.97 · 12,847 data points
Warning
GPT-4o-mini drops below 80% accuracy for structured JSON output above 30K context tokens
1,203 data points · detected 3 days ago
Pattern
Teams that route code review to Sonnet instead of Opus save 68% with no measurable quality difference
847 teams · 94% confidence

🔒 Never shared. Ever.

Your prompts or conversationsAPI keys or credentialsFile contents or codePersonal informationBusiness logic or dataBehavioral metadata
PRICING

Free to start. Pays for itself in hours.

Free
$0
forever
Everything that runs on YOUR machine. No artificial limits.
  • Full proxy — unlimited requests
  • Local cost dashboard
  • Smart routing & policy engine
  • All 11 providers supported
  • 7-day local history
  • Circuit breaker safety
  • Open source · MIT license
Install Free
Solo devs get everything. No gates. No trials.
Pro
$25/mo
coming soon
Private team mesh — the control plane for teams.
  • Everything in Contributor
  • Shared team dashboard
  • Governance: per-agent/user spend limits
  • Approval flows ("junior devs can't use Opus")
  • Audit logs & 90-day history
  • Multi-seat & org-wide price caps
  • Priority support
For teams who need governance, not solo devs who need features
FeatureFreeContributorPro
Full proxy (unlimited)
Local dashboard
All 11 providers
Smart routing & policies
Circuit breaker
7-day local history90 days
Mesh opt-in (coming soon)
Shared team dashboard
Governance & approval flows
Spend limits per agent/user
Audit logs & multi-seat
THE DATA

Built on pain we measured.

211K
OpenClaw ecosystem
GitHub stars
582K
Views on the
API cost crisis
$35–720
Monthly user
spend range
77%
Cost reduction
documented
"I was mass spending $200+/month running an agent swarm and had zero visibility into where the money was going. Turns out 73% of my requests were using Opus for tasks Haiku could handle."— Matt Turley, Continuum
FAQ

Common questions

Will this break my OpenClaw setup?

No. RelayPlane uses a circuit breaker architecture. If the proxy fails for any reason, all traffic automatically bypasses it and goes directly to your LLM provider. Your agent doesn't even notice. If RelayPlane can't route, it passes through to your default model. Worst case: you pay what you would have paid anyway. We learned this lesson the hard way and built the safety model first.

What data do you collect?

Only anonymized metadata: task type, token count, model used, success/fail. Never your prompts, code, or outputs. Your actual prompts never leave your machine. On the free tier, nothing leaves your machine at all — everything runs locally. If you opt into the Contributor tier, only these anonymized routing signals are shared.

How does smart routing actually work?

RelayPlane classifies each request by task complexity (simple, medium, complex) and routes it to the cheapest model that can handle it. File reads and simple edits go to Haiku. Code generation goes to Sonnet. Complex architecture and reasoning stays on Opus. You can customize the rules or use our defaults.

Does it work with models other than Anthropic?

Yes. RelayPlane supports any OpenAI-compatible API — Claude, GPT-4o, Gemini, Mistral, open-source models via OpenRouter. The routing engine is model-agnostic.

How much will I actually save?

It depends on your usage pattern, but most users see 40-70% cost reduction. The biggest savings come from routing simple tasks (which are typically 60-70% of all requests) to cheaper models. You'll see exact numbers in your dashboard within the first hour.

Is this just a proxy? How is it different from OpenRouter?

OpenRouter routes your requests — that's it. RelayPlane observes (full cost dashboard), governs (policy engine with rules you control), and learns (collective intelligence from the mesh network). Smart routing based on collective data is coming soon — for now, you get full cost visibility and the tools to optimize manually.

What happens if I stop using it?

Nothing. Remove the proxy, your agents talk directly to providers again. No lock-in, no migration, no data hostage. It's MIT licensed — you can fork it and run your own if you want.

What if a cheaper model returns bad results?

RelayPlane automatically retries with a better model. You pay for both calls, but that's still cheaper than always using Opus. Collective failure learning is on the roadmap — for now, the proxy logs these so you can adjust your routing rules.

What if RelayPlane shuts down?

The local proxy works forever — it's MIT licensed software on your machine. Only the mesh network would stop. You'd keep ~30% savings on static rules.

How do I know I'm actually saving money?

Run relayplane stats or check the dashboard. We show you exactly how much you've saved vs. what you would have spent.

Stop paying for your agent's mistakes.

Install RelayPlane. See your costs in 3 minutes.

npm install -g @relayplane/proxy && relayplane init