RelayPlane is an npm-native Node.js LLM proxy. Install with npm install @relayplane/proxy, point your ANTHROPIC_BASE_URL or OPENAI_BASE_URL at http://localhost:4100, and get per-request cost tracking, complexity-based model routing, and Ollama local fallback, no Docker required.

How does RelayPlane compare to LiteLLM?

LiteLLM is a Python library with a proxy server option. RelayPlane is npm-native for Node.js developers, no Python, no Docker, just npm install. RelayPlane has per-request cost tracking built in and integrates directly with Claude Code, Cursor, and OpenClaw via a simple base URL override.

How does RelayPlane compare to Portkey or Helicone?

Portkey and Helicone offer cloud observability and are npm-compatible. RelayPlane runs entirely locally (no cloud required), is open source under MIT, and focuses on cost tracking plus complexity-based routing for Node.js developers using Claude Code, Cursor, and OpenClaw.

Does RelayPlane work with Claude Code and Cursor?

Yes. RelayPlane is designed for Claude Code, Cursor, and any tool that supports ANTHROPIC_BASE_URL or OPENAI_BASE_URL. Set ANTHROPIC_BASE_URL=http://localhost:4100 and RelayPlane intercepts every request, tracks costs, and routes to cheaper models for simple tasks.

The core RelayPlane proxy and local dashboard are free forever. Open source (MIT), runs locally on your machine, your keys and prompts never leave your box. Pro ($19/mo) adds network routing intelligence and 90-day history. Max ($99/mo) adds team coordination, audit logs, and approval gates. See relayplane.com/pricing for the full breakdown.

How do I install the RelayPlane npm LLM proxy?

Run: npm install -g @relayplane/proxy. Then: relayplane init && relayplane start. The local dashboard runs at http://localhost:4100. Set ANTHROPIC_BASE_URL=http://localhost:4100 in your environment and all Claude requests route through RelayPlane automatically.

RelayPlane/Signal

Signal

Name: RelayPlane
Author: Matt Turley

Real data and hard-won lessons on AI agent cost control, model routing, and building with MCP.

anthropicbillingrelayplane

Anthropic Just Made Model Routing a Billing Decision

Starting today, Claude Pro and Max subscriptions no longer cover third-party tools. Here's the real math, three paths forward, and why smart routing is the right answer.

April 4, 2026·7 min read·Matt Turley

anthropicclaude-maxextra-usage

Anthropic Is Now Charging Per Token for Third-Party Tools on Max and Pro

As of April 4, 2026, Max and Pro subscribers using Cline, aider, Roo Code, or any non-Anthropic tool now pay per-token API rates on top of their subscription. Here's the math, the credit deadline, and what to do about it.

April 4, 2026·6 min read·Matt Turley

llm-proxyllm-gatewayai-infrastructure

LLM Proxy vs LLM Gateway: What's the Difference and Which Do You Need?

LLM proxy and LLM gateway sound the same. They're not. Here's the actual difference, a feature comparison, and how to decide which one fits your stack.

March 28, 2026·7 min read·Matt Turley

relayplanelitellmllm-proxy

LiteLLM Alternatives After the Supply Chain Attack (PyPI 1.82.7/1.82.8)

LiteLLM PyPI versions 1.82.7 and 1.82.8 were compromised in a supply chain attack affecting 95M+ downloads. If you're rethinking your LLM proxy stack, here are the best alternatives.

March 25, 2026·4 min read·Matt Turley

claude-maxclaude-apicost-optimization

Claude Code Max vs API: When Does $200/Mo Actually Save Money?

The break-even math on Claude Max vs API-only. At what daily usage does $200/mo flat beat pay-per-token? We run the numbers so you can stop guessing.

March 24, 2026·6 min read·Matt Turley

agent runaway costsllm agent budget limitsprevent ai agent overspending

Agent Runaway Costs: How to Set LLM Budget Limits Before Costs Spiral

Agent runaway costs are happening right now to teams shipping AI agents without budget guardrails. Here is why it happens, how to add limits in code, and how to enforce them at the infrastructure layer with RelayPlane.

March 24, 2026·7 min read·Matt Turley

npm llm proxyllm proxy nodejsbest llm proxy npm 2026

Best LLM Proxy for Node.js in 2026: npm-Native Options Compared

A comparison of the top npm LLM proxy packages for Node.js developers in 2026: per-request cost tracking, no Docker, native Node.js setup, and honest tradeoffs between RelayPlane, Portkey, Helicone, and llm-proxy.

March 19, 2026·7 min read·Matt Turley

relayplaneai-proxycost-tracking

AI Agent Proxy Cost Tracking: Stop Paying Surprise LLM Bills

Autonomous agents make API calls on your behalf with no natural pause point. An AI agent proxy sits between your agents and your LLM provider to classify, track, and cap spend before the bill arrives.

March 16, 2026·7 min read·Matt Turley

openclawcost-controlapi-costs

OpenClaw API Costs Explained: Where Your Tokens Actually Go

A breakdown of how OpenClaw billing works, which agent operations burn the most tokens, and how to get visibility before costs surprise you.

March 16, 2026·6 min read·Matt Turley

relayplanetelemetrymesh

Telemetry and Mesh Are Now On by Default

Starting with v1.9, RelayPlane enables telemetry and mesh by default. Here is what changed, why, and how to opt out.

March 13, 2026·4 min read·Matt Turley

relayplaneopenclawai-proxy

How to Set Up a Proxy for OpenClaw Agents and Cut API Costs

Route OpenClaw agent calls through a local proxy to cut API costs without changing any application code. Complexity routing, budget controls, and multi-provider setup in 5 minutes.

March 13, 2026·8 min read·Matt Turley

relayplaneanthropicclaude

How to Route Claude API Requests Through a Local Proxy (With Automatic Cost Savings)

Set ANTHROPIC_BASE_URL once, get smart model routing, per-request cost tracking, and budget enforcement for free. Here is how a local proxy changes your Claude setup.

March 13, 2026·5 min read·Matt Turley

relayplaneopenainodejs

Building a Node.js OpenAI Proxy in 2026: DIY vs @relayplane/proxy

Most teams build a proxy once, then realize they've built a second product. Here's what the DIY path actually looks like, and the 3-line alternative.

March 12, 2026·6 min read·Matt Turley

relayplanellmcost-tracking

How to Track LLM API Costs Per Request in Node.js

Per-request LLM cost tracking without building an accounting system. Here's what works, what breaks, and how to do it without hardcoded pricing constants.

March 12, 2026·7 min read·Matt Turley

relayplaneanthropicclaude

Claude API Behind a Proxy: Cost Control, Fallbacks, and Routing for Production Apps

Three production concerns for every Claude deployment: rate limits, cost overruns, and provider fallbacks. Here's the clean solution for each.

March 12, 2026·7 min read·Matt Turley

relayplanellm-gatewaycomparison

LLM Gateway Comparison 2026: OpenRouter, Cloudflare, LiteLLM, and RelayPlane

OpenRouter, Cloudflare AI Gateway, LiteLLM, and RelayPlane compared. Different tools for different stacks, here is what each one actually delivers in 2026.

March 12, 2026·8 min read·Matt Turley

claude-codeapi-costscost-optimization

Claude Code API Costs: Where the Tokens Go and How to Cut Them

Claude Code makes a lot of API calls. Here is where the costs come from, how to track them per session, and how to cut your bill 50-70% with model routing and budget limits.

March 12, 2026·7 min read·Matt Turley

relayplanellm-proxycomparison

RelayPlane vs LiteLLM vs Helicone vs Bifrost: The LLM Gateway Comparison for 2026

Four LLM gateways compared: RelayPlane, LiteLLM, Helicone, and Bifrost. Different tools, different tradeoffs, different stacks. Here is an honest breakdown.

March 11, 2026·6 min read·Matt Turley

ai-agentsx402-protocolautonomous-agents

6 Ways AI Agents Can Earn Money in 2026

The x402 protocol is turning AI agents into economic actors. Here are 6 concrete ways autonomous agents are already earning USDC, ranked by how fast you can actually ship them.

March 11, 2026·10 min read·Matt Turley

relayplanemcpclaude-code

Introducing @relayplane/mcp-server: Budget-Aware AI Routing via MCP

Give your AI agents budget guardrails and multi-provider routing with one MCP server. No more surprise API bills.

March 10, 2026·5 min read·Matt Turley

relayplanellmapi-costs

How to Estimate LLM API Costs Before You Build

Surprised by your first AI API bill? You're not alone. Here's how to estimate LLM API costs upfront, avoid the shock, and instrument your agent properly before you scale.

March 10, 2026·8 min read·Matt Turley

relayplaneopenclawapi-costs

OpenClaw API Costs: How to Stop Overpaying for AI in Your Agent

Running an OpenClaw-based agent and watching the API bill creep up? Here's where the money goes and how to cut it without changing a line of application code.

March 9, 2026·6 min read·Matt Turley

relayplaneai-routingmulti-model

Integrating Multiple AI Models Through One Control Plane: A Developer's Guide

Most AI projects start with one provider and end up juggling four. Here's how to build a control plane that handles routing, fallback, and budget enforcement without rewriting everything.

March 9, 2026·9 min read·Matt Turley

relayplaneai-proxycost-control

Why I Built a Lightweight AI Proxy and What I Learned

I built an AI proxy after a $340 Anthropic bill I never saw coming. Here's what I learned about routing, storage, and building infrastructure tooling in public.

March 8, 2026·7 min read·Matt Turley

relayplaneclaude-codeapi-costs

How to Cut Claude Code API Costs 70% in 5 Minutes with RelayPlane

Route Claude Code through a local proxy and stop burning Opus tokens on file reads and git status. Three commands, one environment variable.

March 8, 2026·6 min read·Matt Turley

ai-agentsagent-opssecurity

Let Your Agents Cook

Running AI agents without review pipelines is how you end up with live credentials committed to git. Here is the mandatory pipeline setup that lets agents ship safely without a human in the loop.

March 7, 2026·4 min read·Matt Turley

relayplanellm-proxyai-agents

Introducing RelayPlane

An open-source LLM proxy that tracks costs, routes by complexity, and stops runaway agent spending. npm install, no Docker, no Python.

February 20, 2026·6 min read·Matt Turley

RSS Feed