RelayPlane is an npm-native Node.js LLM proxy. Install with npm install @relayplane/proxy, point your ANTHROPIC_BASE_URL or OPENAI_BASE_URL at http://localhost:4100, and get per-request cost tracking, complexity-based model routing, and Ollama local fallback, no Docker required.

How does RelayPlane compare to LiteLLM?

LiteLLM is a Python library with a proxy server option. RelayPlane is npm-native for Node.js developers, no Python, no Docker, just npm install. RelayPlane has per-request cost tracking built in and integrates directly with Claude Code, Cursor, and OpenClaw via a simple base URL override.

How does RelayPlane compare to Portkey or Helicone?

Portkey and Helicone offer cloud observability and are npm-compatible. RelayPlane runs entirely locally (no cloud required), is open source under MIT, and focuses on cost tracking plus complexity-based routing for Node.js developers using Claude Code, Cursor, and OpenClaw.

Does RelayPlane work with Claude Code and Cursor?

Yes. RelayPlane is designed for Claude Code, Cursor, and any tool that supports ANTHROPIC_BASE_URL or OPENAI_BASE_URL. Set ANTHROPIC_BASE_URL=http://localhost:4100 and RelayPlane intercepts every request, tracks costs, and routes to cheaper models for simple tasks.

Yes, completely. RelayPlane is free and open source (MIT). Every feature is included: full proxy, local and cloud dashboard, all 11 providers, budget enforcement, anomaly detection, hard cost caps, auto-kill, and 90-day history. It runs locally on your machine, your keys and prompts never leave your box. No tiers, no paywalls, no credit card. See relayplane.com/pricing.

How do I install the RelayPlane npm LLM proxy?

Run: npm install -g @relayplane/proxy. Then: relayplane init && relayplane start. The local dashboard runs at http://localhost:4100. Set ANTHROPIC_BASE_URL=http://localhost:4100 in your environment and all Claude requests route through RelayPlane automatically.

RelayPlane vs Kong Agent Gateway

Kong rebranded its AI Gateway to Kong Agent Gateway in version 3.14 (April 2026) to add agent-to-agent traffic routing. It still requires Docker or Kubernetes and targets ops teams already running Kong. RelayPlane is npm-native, local-first, and built for LLM cost intelligence with native A2A spend attribution. No Docker. No YAML. 30-second setup.

TL;DR

Choose RelayPlane if you want:

npm install in 30 seconds, no Docker or Kubernetes required
Per-request USD cost tracking stored locally in SQLite
Per-agent cost attribution with runaway loop detection
Dynamic cost-optimized routing by task complexity
Works with Claude Code, Cursor, and any OpenAI-compatible tool

Kong Agent Gateway 3.14 may fit if you need:

Your team already runs Kong and wants to add A2A routing to the same platform
Enterprise-grade rate limiting, semantic caching, and AI audit logs via Kong plugins
You have a platform engineering team to manage container infrastructure

Feature Comparison

Feature	RelayPlane	Kong Agent Gateway 3.14
Installation method RelayPlane is a single npm command. Kong AI Gateway requires Docker (or Kubernetes), a running container, and a declarative kong.yml configuration file before any LLM request can be routed.	npm install -g @relayplane/proxy	docker run kong/kong-gateway + YAML config
Setup time RelayPlane runs on localhost:4100 immediately after install. Kong requires pulling a Docker image, writing a kong.yml routes file, enabling AI plugins, and validating the configuration.	~30 seconds	30+ minutes (Docker, kong.yml, plugin config)
Docker or Kubernetes required RelayPlane needs only Node.js and npm. Kong AI Gateway is distributed as a Docker image and is commonly deployed on Kubernetes in production, requiring container orchestration infrastructure.
Primary audience RelayPlane targets developers building AI agents who want cost visibility in minutes. Kong AI Gateway is designed for enterprise platform teams that already operate Kong for API management at scale.	Individual developers and agent builders	Enterprise ops teams already running Kong
Configuration style RelayPlane is configured with a simple local config. Kong uses a declarative YAML file with services, routes, and plugin definitions, or the Kong Admin REST API for dynamic configuration.	CLI flags and local config file	Declarative YAML (kong.yml) or Kong Admin API
Per-request USD cost tracking RelayPlane computes exact USD cost per request using live pricing tables and stores results locally. Kong AI Gateway tracks requests and tokens but does not compute or attribute dollar costs per request.
Per-agent cost attribution RelayPlane fingerprints system prompts to attribute USD cost and usage to individual agents. Kong Agent Gateway 3.14 adds agent-to-agent routing infrastructure but does not compute or store per-agent dollar costs. You can route traffic between agents; you cannot see what each agent spent.
Cost dashboard RelayPlane provides a local cost dashboard showing spend by model, agent, and time period. Kong AI Gateway surfaces metrics through Kong's plugin system (Prometheus, Datadog) but has no built-in LLM cost dashboard.
Smart routing / cost-optimized routing RelayPlane dynamically routes by task complexity and cost, sending simple tasks to cheaper models. Kong supports load balancing across upstreams but does not route by LLM task complexity or cost.		Static load balancing across configured upstreams
Agent-to-agent (A2A) traffic routing Kong AI Gateway rebranded to Kong Agent Gateway in version 3.14 (April 2026) to provide dedicated routing infrastructure for agent-to-agent traffic. RelayPlane handles A2A traffic through system-prompt fingerprinting and per-agent attribution, tracking what each agent in a multi-agent chain spent without requiring a separate routing layer or container deployment.	Native A2A cost tracking and attribution	Kong Agent Gateway 3.14 adds dedicated A2A routing layer
Per-agent USD spend in multi-agent chains When agent A calls agent B which calls agent C, RelayPlane attributes the exact USD cost to each agent in the chain. Kong Agent Gateway 3.14 routes the A2A traffic but does not break down spend by agent identity within the chain.
Semantic caching Kong AI Gateway offers a semantic caching plugin that can return cached responses for similar queries. RelayPlane does not currently include semantic caching.		Available via Kong AI Semantic Cache plugin
Rate limiting Kong has a mature, production-grade rate limiting plugin with fine-grained controls built on years of API gateway experience. RelayPlane provides basic throttling.	Basic request throttling	Advanced rate limiting with Kong Rate Limiting plugin
Multi-provider LLM support Both tools support multiple LLM providers. RelayPlane uses OpenAI-compatible routing across providers. Kong AI Gateway configures providers through declarative plugin definitions in kong.yml.		OpenAI, Anthropic, Cohere, Azure OpenAI
AI audit logs Kong offers an AI Audit Log plugin for compliance use cases. RelayPlane stores request and cost logs locally in SQLite, accessible without additional plugin configuration.	Local SQLite request log	AI Audit Log plugin available
npm-native RelayPlane is a standard npm package that integrates directly into Node.js and TypeScript workflows. Kong has no npm package and is not designed for npm-based development workflows.
Local-first (no infrastructure required) RelayPlane is fully local with no server, container, or cloud dependency. Kong AI Gateway requires a running container or cloud deployment, making local developer use significantly more complex.
Open source Kong Gateway CE is open source under Apache 2.0. Some Kong AI Gateway features and plugins are available only through Kong Konnect, the enterprise SaaS platform. RelayPlane's proxy core is MIT-licensed.	Core proxy is open source (MIT)	Kong Gateway is open source; AI features may require Kong Konnect

Why Agent Developers Choose RelayPlane

npm install in 30 seconds, no Docker or Kubernetes required

RelayPlane is npm install -g @relayplane/proxy && relayplane init && relayplane start. That is it. No Docker image to pull, no kong.yml to write, no plugin configuration, no Admin API calls. Kong AI Gateway requires container infrastructure before a single LLM request can be routed, making it a multi-hour setup for a developer who just wants cost visibility.

Kong Agent Gateway 3.14 routes A2A traffic. RelayPlane prices it.

Kong rebranded to Kong Agent Gateway in version 3.14 to add agent-to-agent routing. It answers: can the traffic flow between agents? RelayPlane answers the question that matters after that: what did each agent in the chain cost, and which one is burning money? Per-request USD costs, per-agent chain attribution, and runaway loop detection are core features in RelayPlane, not add-ons to an enterprise API management platform.

Dynamic cost-optimized routing, not static upstream configuration

Kong routes LLM requests to configured upstreams using load balancing rules you define in kong.yml. RelayPlane routes dynamically: simple tasks go to Haiku, complex tasks go to Opus, and the routing decision is based on live cost and complexity signals. You get cost reduction without manually tuning YAML configuration files.

Works with Claude Code, Cursor, and any OpenAI-compatible tool in seconds

RelayPlane is a drop-in localhost proxy on port 4100. Point any OpenAI-compatible tool at http://localhost:4100 and cost tracking starts immediately. Routing Claude Code or Cursor through Kong AI Gateway requires Docker, a running container, network configuration, and plugin setup that is foreign to the typical agent developer workflow.

When Kong Agent Gateway 3.14 is the right fit

Kong is a mature, battle-tested API gateway with years of production use across large enterprise deployments. In version 3.14 (April 2026), Kong rebranded to Kong Agent Gateway and added dedicated agent-to-agent routing infrastructure. If your organization already runs Kong and your platform team wants to handle A2A traffic routing in the same container-based platform, that is a reasonable extension of what Kong already does well.

If you are building AI agents in Node.js or TypeScript, using Claude Code or Cursor, and need to understand and control your LLM spend without standing up container infrastructure, Kong is the wrong tool for the job. Kong Agent Gateway 3.14 routes A2A traffic but does not tell you what each agent in a chain actually cost. RelayPlane tracks A2A spend natively: every agent in the chain gets attributed a USD cost, runaway loops are detected, and the whole thing runs in 30 seconds from npm install.

Cut your agent costs by 50-80%

No Docker. No Kubernetes. No kong.yml. No container infrastructure to manage. One npm command and your local cost-intelligence proxy is running.

npm install -g @relayplane/proxy && relayplane init && relayplane start