Quick Start
Get the proxy running in under 60 seconds. Node.js 18+ required.
Claude Code / Max Plan
No API key needed. Your subscription token passes through automatically.
Step 1, Install and start the proxy
1npm install -g @relayplane/proxy2relayplane startStep 2, Point Claude Code at the proxy
Run this in the same terminal you'll use for Claude Code. Or add it to ~/.zshrc / ~/.bashrc so it's always set.
1export ANTHROPIC_BASE_URL=http://localhost:4100Common issue: If the dashboard stays empty, this env var isn't set in the terminal where you run
claude. Run echo $ANTHROPIC_BASE_URL, it should print http://localhost:4100. If blank, re-run the export above.Step 3, Start Claude Code
1claudeRequests flow through the proxy automatically. Open http://localhost:4100 to see cost breakdowns.
API Key Users
1npm install -g @relayplane/proxy1export ANTHROPIC_API_KEY=sk-ant-...2# Optional: OpenAI, Gemini, xAI, etc.1relayplane start1export ANTHROPIC_BASE_URL=http://localhost:4100Check your costs
1relayplane statsView usage and cost breakdown by model. Dashboard also available at http://localhost:4100.
That's it. Simple tasks route to cheaper models automatically. Complex reasoning stays on Opus.
Reliability built in: If a pooled credential returns 2 consecutive 401s (auth failure), the proxy auto-quarantines it for 1 hour and falls back to the next credential in the pool. See fallback docs for details on credential quarantine behavior.
Optional: smarter model routing. You can run a local semantic classifier sidecar (ModernBERT-base) so the proxy routes by task type rather than regex patterns. See Semantic Routing for setup.
Next Steps
- Set up with OpenClaw
- CLI Reference
- Telemetry, off by default, nothing leaves your network
- Semantic Routing, optional local classifier for higher-accuracy task routing