RelayPlane is an npm-native Node.js LLM proxy. Install with npm install @relayplane/proxy, point your ANTHROPIC_BASE_URL or OPENAI_BASE_URL at http://localhost:4100, and get per-request cost tracking, complexity-based model routing, and Ollama local fallback, no Docker required.

How does RelayPlane compare to LiteLLM?

LiteLLM is a Python library with a proxy server option. RelayPlane is npm-native for Node.js developers, no Python, no Docker, just npm install. RelayPlane has per-request cost tracking built in and integrates directly with Claude Code, Cursor, and OpenClaw via a simple base URL override.

How does RelayPlane compare to Portkey or Helicone?

Portkey and Helicone offer cloud observability and are npm-compatible. RelayPlane runs entirely locally (no cloud required), is open source under MIT, and focuses on cost tracking plus complexity-based routing for Node.js developers using Claude Code, Cursor, and OpenClaw.

Does RelayPlane work with Claude Code and Cursor?

Yes. RelayPlane is designed for Claude Code, Cursor, and any tool that supports ANTHROPIC_BASE_URL or OPENAI_BASE_URL. Set ANTHROPIC_BASE_URL=http://localhost:4100 and RelayPlane intercepts every request, tracks costs, and routes to cheaper models for simple tasks.

Yes, completely. RelayPlane is free and open source (MIT). Every feature is included: full proxy, local and cloud dashboard, all 11 providers, budget enforcement, anomaly detection, hard cost caps, auto-kill, and 90-day history. It runs locally on your machine, your keys and prompts never leave your box. No tiers, no paywalls, no credit card. See relayplane.com/pricing.

How do I install the RelayPlane npm LLM proxy?

Run: npm install -g @relayplane/proxy. Then: relayplane init && relayplane start. The local dashboard runs at http://localhost:4100. Set ANTHROPIC_BASE_URL=http://localhost:4100 in your environment and all Claude requests route through RelayPlane automatically.

How to Cut OpenClaw API Costs with RelayPlane Proxy

If you're running OpenClaw agents, you already know the drill: your API bill climbs every month, and most of the spend is wasteful. The average OpenClaw setup burns $100-300/mo on API calls, and the majority of those requests don't need an expensive model. Here's how to fix that.

What Is an AI Proxy (And Why Should You Care)?

An AI proxy sits between your application and your AI providers. Every API call passes through it. Instead of blindly forwarding requests to whatever model you configured, the proxy inspects the task, classifies its complexity, and routes it to the right model.

This isn't a load balancer. It's a cost intelligence layer.

When OpenClaw sends a request to ANTHROPIC_BASE_URL or OPENAI_BASE_URL, the proxy intercepts it. Simple task like reading a config file? Route to Haiku or GPT-4o-mini at a fraction of the cost. Complex debugging session? Send it to Opus. The routing is rule-based and deterministic, you define the complexity mappings, the proxy enforces them on every request.

Cost Breakdown: Before vs. After Proxy Routing

Here's a typical OpenClaw agent workload breakdown:

Task Type	% of Requests	Examples	Without Proxy	With Proxy
Simple	~60%	File reads, status checks, formatting, short rewrites	Opus/GPT-4o ($$$)	Haiku/GPT-4o-mini ($)
Medium	~25%	Code review, summaries, data extraction	Opus/GPT-4o ($$$)	Sonnet/GPT-4o ($$)
Complex	~15%	Architecture decisions, hard debugging, long-form creative	Opus/GPT-4o ($$$)	Opus/GPT-4o ($$$)

The math: if you're spending $200/mo, roughly $120 of that is simple tasks hitting expensive models. Route those to Haiku-class models and that $120 drops to about $10-15. Medium tasks go from $50 to $30. Complex tasks stay the same. $200/mo becomes $55-75/mo. Your actual savings depend on your workload mix.

Setting Up RelayPlane as Your OpenClaw Proxy

This takes about 5 minutes. Four steps.

Install

$npm install -g @relayplane/proxy

Requires Node.js 18+. That's the only dependency.

Initialize Configuration

$relayplane init

This creates your config file with sensible defaults. It'll ask which providers you use and generate the routing rules. The default setup uses complexity-based routing with three tiers:

routing:
  mode: complexity
  models:
    simple: anthropic/claude-3-haiku
    medium: anthropic/claude-3.5-sonnet
    complex: anthropic/claude-3-opus
  cascade:
    enabled: true
    strategy: cost  # try cheapest first, fall back on failure

Start the Proxy

$relayplane start

The proxy starts on localhost:4100 by default. Now point OpenClaw at the proxy by setting these environment variables:

$export ANTHROPIC_BASE_URL=http://localhost:4100

$export OPENAI_BASE_URL=http://localhost:4100

Verify It's Working

$relayplane stats

This shows you a breakdown of requests by model, provider, cost, and routing decisions. Run a few OpenClaw tasks and check the stats to confirm requests are being routed to different models based on complexity.

Advanced: Multi-Provider Routing

The real power shows up when you route different task types to entirely different providers:

routing:
  mode: complexity
  models:
    simple: groq/llama-3-8b        # blazing fast, nearly free
    medium: anthropic/claude-3.5-sonnet  # strong all-rounder
    complex: openai/gpt-4-turbo         # heavy reasoning
  cascade:
    enabled: true
    strategy: cost
    fallback:
      - anthropic/claude-3-opus
      - openai/gpt-4o

Set budget caps with the budget command:

$relayplane budget --daily 10 --action warn

$relayplane budget --hourly 2 --action block

Budget enforcement supports four actions: block (hard stop), warn (log and continue), downgrade (force cheaper model), and alert (notification). RelayPlane also includes anomaly detection that catches cost spikes, token explosions, and runaway loops automatically.

Privacy and Telemetry

RelayPlane runs entirely on your machine. Your prompts, responses, and API keys never leave localhost. There is no cloud component in the free tier, the proxy is a local process, the dashboard is a local web server.

Telemetry is on by default. It sends anonymous aggregate data: token counts, latency measurements, and model usage patterns. No prompt content, no API keys, no personally identifiable information.

$relayplane telemetry # see current status

$relayplane telemetry off # disable completely

Results and Next Steps

Once the proxy is running, check your dashboard after a day or two of normal usage. Typical results: 40-70% cost reduction depending on your workload mix. Agents that do a lot of simple tasks see the highest savings.

Install and run the proxy, takes 5 minutes, free tier has no limits on requests

Check the cost calculator, plug in your current usage to estimate savings

Tune your routing, after a few days, review stats and adjust complexity model mappings

Set budgets, configure daily and hourly limits so costs can never run away

MIT licensed, free to use with unlimited requests

Free tier includes cost tracking, local dashboard, routing, caching, and 7-day history

Zero code changes to your OpenClaw setup, just one environment variable

Falls back to direct API if the proxy has any issues

Start saving in 30 seconds

Free tier. No credit card. Unlimited requests. Your API keys stay on your machine.

$npm install -g @relayplane/proxy