relay-ai
Relay any model into any coding agent — launch tools, switch providers, and run local API gateways.
Watch the Demos:
- Demo (Part 1): Installation · Configuration · Claude Code · Claude Cowork & Claude Code Desktop — watch on YouTube
- Demo (Part 2): OpenAI Codex CLI & Codex Desktop App — watch on YouTube
relay-ai is an interactive CLI that launches AI coding tools and runs local API gateways on your machine. Currently, it supports Claude Code, Claude Desktop (Cowork + Code), the OpenAI Codex CLI, the Codex desktop app (macOS + Windows), and the Google Gemini CLI.
Pick your backend:
- Your providers — configure once with
relay-ai providers(Groq, Mistral, Nvidia, DeepSeek, custom OpenAI/Anthropic endpoints, and more) - OpenCode Zen / Go — cloud models with your OpenCode API key (optional; add via
relay-ai providers) - One-time OpenCode import — bring existing OpenCode provider settings into the registry (
relay-ai providers import) - Google Vertex AI — Claude on Vertex via
relay-ai server --vertexand local gcloud credentials (no OpenCode key required)
Commands
| Command | Description |
|---|---|
relay-ai |
Print help (does not launch Claude Code) |
relay-ai claude |
Pick a provider → launch Claude Code |
relay-ai providers |
Add, import, list, remove, and refresh your AI providers |
relay-ai models |
Manage favorite models for mid-session /model switching |
relay-ai server |
Foreground API gateway (registry providers + optional Zen/Go) |
relay-ai server --vertex |
Foreground Anthropic-compatible gateway to Claude on Vertex AI |
relay-ai claude-app |
Launch Claude Desktop app with registry providers (guide) |
relay-ai codex |
Launch OpenAI Codex CLI with registry providers (guide) |
relay-ai codex-app |
Launch Codex desktop app with registry providers (guide) |
relay-ai gemini |
Launch Google Gemini CLI with registry providers |
relay-ai --ai |
Full agent reference for scripts and alef-agent (guide) |
Features
- Native provider registry:
relay-ai providersstores config in~/.relay-ai/providers.jsonand secrets in the OS keychain — no OpenCode binary required at launch. See docs/PROVIDERS.md for a full list of providers and known issues. - Provider templates: Add Groq, Mistral, Together, OpenRouter, and 15+ SDK-backed providers, plus custom OpenAI/Anthropic-compatible endpoints
- OpenCode import: One-time migration from OpenCode (
providers import); validates API keys and skips placeholders likeanything - OpenCode Zen / Go: Optional cloud backends when you have an OpenCode API key
- SDK adapter proxy: Non-Anthropic providers route through the Vercel AI SDK (same packages OpenCode uses), so Claude Code still speaks Anthropic format. Labeled
(via proxy)in the picker - Favorite models: Save up to 20 and switch mid-session with Claude Code's
/modelcommand - Smart model pickers: Recent models per provider, search for large lists (>25), paginated browse (15 per page)
- Refresh model lists:
relay-ai providers refresh-modelsupdates cached catalogs per provider - API server: Run a local gateway on port 17645 for Claude Code, Claude Desktop, or any Anthropic-compatible client
- Server wizard: Filter exposed providers, mask discovery ids for Claude Desktop, optional favorites-only catalog, local vs network listen mode
- Vertex gateway: Anthropic-compatible Claude on Google Vertex AI using gcloud Application Default Credentials
- Clean environment isolation: We strip 17 conflicting env vars (Vertex AI, Bedrock, AWS, Foundry, stale Anthropic config) from the child process only. We never touch
~/.claude/settings.json(see caveat below) - Secure key storage: Per-provider keys and the OpenCode API key go in the OS credential store (macOS Keychain, Windows Credential Manager, Linux Secret Service) or your shell profile
- Cross-platform: macOS, Windows, Linux (Ubuntu, Fedora, distros with GNOME Keyring or KWallet)
- Dry run mode: Walk through the full wizard and preview the launch command without starting anything
- Preference memory: Last provider and model are pre-selected next time
- Agent / headless launch: Boot flags (
--provider,--model), clean NDJSON/JSONL stdout for alef-agent, andrelay-ai --aireference — see docs/AI-AGENTS.md
Supported tools
| Tool | Command | Status |
|---|---|---|
| Provider registry | relay-ai providers |
Supported (guide) |
| Claude Code | relay-ai claude |
Supported |
| Favorite models | relay-ai models |
Supported |
| OpenCode API server | relay-ai server |
Supported |
| Vertex API gateway | relay-ai server --vertex |
Supported |
| Claude Desktop (Cowork + Code) | relay-ai claude-app |
Supported macOS + Windows (guide) |
| Codex CLI | relay-ai codex |
Supported (guide) |
| Codex desktop app | relay-ai codex-app |
Supported macOS + Windows (guide) |
| Google Gemini CLI | relay-ai gemini |
Experimental, model switching is done via .model prompt |
Prerequisites
- Node.js 18+
- A supported AI coding tool installed (e.g. Claude Code, OpenAI Codex, or Google Gemini CLI)
- At least one provider configured via
relay-ai providers addorimport— or an OpenCode API key for Zen/Go cloud backends - OpenCode CLI only if you want one-time import from an existing OpenCode setup (optional)
- For Vertex gateway: Google Cloud SDK with
gcloud auth application-default login, a GCP project with Vertex AI enabled, and Claude partner models enabled in that project
A note on providers: relay-ai keeps your provider list in ~/.relay-ai/providers.json. You can add providers directly (API key + template), import from OpenCode once, or use Zen/Go cloud backends. OpenCode is not required after setup.
Installation
To install the CLI globally:
npm install -g @jacobbd/relay-ai
Upgrading
To upgrade to the latest version:
npm update -g @jacobbd/relay-ai
Uninstallation
To uninstall the CLI globally:
npm uninstall -g @jacobbd/relay-ai
If you use a Node version manager like NVM, make sure you run the uninstall command using the active Node version that was used to install it (e.g., run
nvm use <version>first).
To fully remove the tool and all its configuration data, you can delete the configuration directory (.relay-ai) on your operating system:
- macOS / Linux:
rm -rf ~/.relay-ai - Windows:
- In Command Prompt:
rmdir /s /q "%USERPROFILE%\.relay-ai" - In PowerShell:
Remove-Item -Recurse -Force "$env:USERPROFILE\.relay-ai"
- In Command Prompt:
Setup
Configure providers
relay-ai providers # hub: add, import, list, refresh models
relay-ai providers add # pick a template or custom endpoint
relay-ai providers import # one-time migration from OpenCode (optional)
On first relay-ai claude run with an empty registry, an inline wizard walks you through Quick start (Zen), import, or opening relay-ai providers.
OpenCode API key (Zen/Go only)
Grab your key at opencode.ai/auth if you use OpenCode Zen or Go (skip for registry-only or Vertex setups).
| Platform | Secure storage | Plaintext fallback |
|---|---|---|
| macOS | Keychain (optional: + ~/.zshrc auto-load) |
Shell profile |
| Windows | Credential Manager | setx user env var |
| Linux (desktop) | Secret Service (GNOME Keyring / KWallet) | Shell profile |
| Linux (headless) | n/a | Shell profile |
The key is active in your current session right away, no matter which option you pick. No terminal restart needed.
Usage
Launch Claude Code
relay-ai claude
First run: pick a provider from your registry (or complete the inline setup wizard). If you've added OpenCode Zen/Go, those appear alongside registry providers like Groq, Nvidia, or DeepSeek.
Favorite models and mid-session switching
Save the models you bounce between:
relay-ai models
Add up to 20 favorites from Zen, Go, or any OpenCode-configured provider. When you have favorites, relay-ai claude starts a multi-route proxy automatically. Claude Code's /model command lists your starting model plus favorites. Switch live, no restart.
No favorites? Launch works like before: single model, no switch menu. --dry-run ignores saved favorites so you can preview a single-model launch.
relay-ai claude options
| Flag | Description |
|---|---|
--dry-run |
Run the full wizard but preview the launch command instead of executing |
--setup |
Reminder to use relay-ai providers for provider setup |
--trace |
Write debug logs to ~/.relay-ai/logs/ and show errors on exit |
--help |
Show command help |
--version |
Show version |
relay-ai claude --dry-run
relay-ai claude --setup
relay-ai claude --trace
Claude Code flags and session IDs pass through unchanged:
relay-ai claude -c
relay-ai claude --resume abc-123
relay-ai claude abc-123
Non-interactive / agent launch — skip the wizard with boot flags:
relay-ai claude --provider groq --model llama-3.3-70b-versatile -p "Summarize README.md"
relay-ai claude --model zen__deepseek-v4-flash-free -p "task" --output-format stream-json
| Flag | Description |
|---|---|
--provider |
Boot provider id (skip wizard with --model or in print mode) |
--model |
Boot model id, or slug provider__model-id |
For alef-agent, NDJSON streaming, Codex exec --json, and sandbox defaults, see docs/AI-AGENTS.md and run relay-ai --ai.
Use -- when you want every following token passed directly to Claude Code:
relay-ai claude -- --print "hello"
relay-ai claude -- --dangerously-skip-permissions
relay-ai claude --dry-run -- --print "test"
Server mode
Run relay-ai as a foreground API gateway on port 17645:
| Mode | Command | Auth | Models |
|---|---|---|---|
| Registry gateway | relay-ai server |
Per-provider keys in registry (+ OpenCode key for Zen/Go if exposed) | Providers you configured |
| Vertex gateway | relay-ai server --vertex |
gcloud Application Default Credentials | Claude on Vertex AI |
Claude Desktop (Cowork + Code): For the automated macOS/Windows setup, use
relay-ai claude-app. For manual or network setups, see docs/CLAUDE_DESKTOP_SETUP.md.
Registry gateway (relay-ai server)
Works with any providers in your registry. Zen/Go models appear when you have an OpenCode API key and those providers are exposed.
The wizard asks:
| Prompt | What it does |
|---|---|
| Configure & start vs Start with saved settings | Full wizard or reuse saved server preferences |
| Exposed providers | Limit which providers appear in the catalog (Zen, Go, Groq, OpenAI, etc.) |
| Mask gateway model ids for discovery? | Recommended Yes for Claude Desktop — hides competitor vendor strings in model ids so discovery works |
| Expose only favorite models? | Optional cap at your favorites (manage with relay-ai models) |
| Listen mode | Local only (127.0.0.1) or Network (0.0.0.0 + server password) |
Local mode — point any Anthropic-compatible client at your machine:
export ANTHROPIC_BASE_URL="http://127.0.0.1:17645/anthropic"
export ANTHROPIC_API_KEY="anything"
Network mode — other devices on your LAN:
export ANTHROPIC_BASE_URL="http://<server-ip>:17645/anthropic"
export ANTHROPIC_API_KEY="<server-password>"
By default the server password stays in memory only. If you choose to save it, relay-ai stores it in ~/.relay-ai/config.json.
OpenAI-format models also get an OpenAI-compatible endpoint:
export OPENAI_BASE_URL="http://127.0.0.1:17645/openai/v1"
export OPENAI_API_KEY="anything"
Health check:
curl -s http://127.0.0.1:17645/health
curl -s http://127.0.0.1:17645/anthropic/v1/models | head
The spinner reports how many models loaded and how many came from registry providers.
Vertex gateway (relay-ai server --vertex)
Anthropic-compatible gateway to Claude on Google Vertex AI. No OpenCode API key required.
Setup:
gcloud auth application-default login
export ANTHROPIC_VERTEX_PROJECT_ID="your-gcp-project" # or GOOGLE_CLOUD_PROJECT
export GOOGLE_CLOUD_LOCATION="global" # optional; default: global
relay-ai server --vertex
Default models: claude-sonnet-4-6, claude-opus-4-6, claude-haiku-4-5
Shorthand aliases (for Claude Code /model and settings.json): sonnet, opus, haiku. Append [1m] for 1M context on Sonnet and Opus only (Haiku stays 200k).
Custom catalog: copy assets/vertex-models.example.json to ~/.relay-ai/vertex-models.json and edit. Override the config directory with RELAY_AI_HOME.
When the gateway is running:
export ANTHROPIC_BASE_URL="http://127.0.0.1:17645/anthropic"
export ANTHROPIC_API_KEY="anything"
Claude Code tip: When routing through the gateway, unset native Vertex env vars so Claude Code doesn't bypass the proxy:
unset CLAUDE_CODE_USE_VERTEX ANTHROPIC_VERTEX_PROJECT_ID CLOUD_ML_REGION
Codex CLI (relay-ai codex)
Launch OpenAI Codex CLI with registry providers. Requires npm install -g @openai/codex.
relay-ai providers add # Anthropic, xAI, OpenAI, etc.
relay-ai codex # pick provider + model → Codex TUI
Claude Desktop app (relay-ai claude-app)
Launch Claude Desktop (macOS or Windows) with registry providers:
relay-ai claude-app
This command automates the "Third-Party Inference" (Developer Mode) setup. It temporarily configures Claude Desktop to point at a local gateway, launches the app, and routes traffic to your chosen provider.
- Keep the terminal open: The proxy runs in the foreground.
- Ctrl+C to restore: When you're done, press
Ctrl+Cin the terminal to automatically restore Claude Desktop to its normal Anthropic cloud mode. - Cleanup: If the terminal crashes, run
relay-ai claude-app --restore.
For manual network setups (e.g., remote cloud desktop), you can still use relay-ai server. See the full Claude Desktop Setup Guide.
relay-ai writes a temporary profile (~/.codex/relay-ai-launch.config.toml) and removes it when Codex exits. After a crash: relay-ai codex --restore.
Sandbox / network: relay-ai codex defaults to danger-full-access (profile + -s flag) so shell tools like curl, nlm, and npm can reach the network. Override for one session:
relay-ai codex -s workspace-write
Pass Codex flags directly after relay-ai codex — you do not need -- before -s. Codex’s --dangerously-bypass-approvals-and-sandbox also passes through if you need it.
Full details: docs/CODEX.md — CLI + desktop app, configs, restore, sandbox, routing.
For agent / alef-agent integration (boot flags, NDJSON, JSONL): docs/AI-AGENTS.md and relay-ai --ai.
Codex desktop app (relay-ai codex-app)
Launch the Codex app (macOS or Windows) with registry providers:
relay-ai codex-app
Patches ~/.codex/config.toml with backup; Ctrl+C in the relay-ai terminal restores your config. The app keeps Codex's built-in openai provider active so existing conversation history remains visible, and routes the selected model through a foreground local proxy. Preview config without writing: relay-ai codex-app --config. Recovery: relay-ai codex-app --restore.
See docs/CODEX.md for CLI vs app differences, file ownership, and troubleshooting.
Reasoning effort: Capable models show Codex's native reasoning picker (low/medium/high, etc.). relay-ai maps your choice to each provider's SDK options and preserves existing model_reasoning_effort in Codex config. Claude Code /effort and the relay-ai server gateway use the same mapping — see the reasoning section in docs/CODEX.md.
Google Gemini CLI (relay-ai gemini)
Launch the Google Gemini CLI with registry providers.
relay-ai gemini
Pick provider → pick model → Gemini prompt loop opens. Non-interactive tasks with streaming NDJSON are also fully supported:
relay-ai gemini --provider google --model gemini-2.5-flash -p "Review this file" -o stream-json
For agent / alef-agent integration (boot flags, NDJSON): docs/AI-AGENTS.md and relay-ai --ai.
How it works
OpenCode Zen / Go filtering
When OpenCode Zen is in your registry, subscriptionFilter controls which Zen models appear (free = free tier only; default = all Zen models). Add or change Zen via relay-ai providers.
Environment isolation
When you launch, relay-ai builds a clean child environment:
- Removes 17 conflicting env vars from the child process (Vertex AI, Bedrock, AWS, Foundry, stale Anthropic config)
- Sets
ANTHROPIC_BASE_URL,ANTHROPIC_API_KEY, andANTHROPIC_MODELfor the session - Passes
--model <selected>to Claude Code as a backup override
When Claude Code exits (normal exit, Ctrl+C, terminal close), your shell is unchanged. No cleanup step. No restore needed.
Caveat: Claude Code persists the model. relay-ai doesn't edit ~/.claude/settings.json, but Claude Code saves the model you launched with (via --model and ANTHROPIC_MODEL). A later bare claude launch may still show that model, e.g. anthropic-opencode-go__deepseek-v4-flash from a prior relay-ai session. To get back to a first-party default, run claude --model sonnet (or your preferred Claude model), or remove the "model" key from ~/.claude/settings.json. If you used the favorites switch menu, Claude Code may also cache the gateway catalog at ~/.claude/cache/gateway-models.json. Delete that file if /model shows stale entries from a dead proxy.
Model compatibility
OpenCode exposes models through different API formats. relay-ai handles them when it can:
| Model format | Examples | How it works | Label |
|---|---|---|---|
| Anthropic native | Claude, Qwen, MiniMax (Go) | Direct connection | (none) |
| OpenAI chat completions | DeepSeek, Kimi, MiMo, GLM, Grok, GPT-4o (OpenCode OpenAI provider) | SDK adapter proxy (Vercel AI SDK) | via proxy |
| OpenAI Responses API | GPT-5.4+, GPT-5.5, Codex, o-series (OpenCode OpenAI provider only) | Same proxy; SDK picks Responses API | via proxy |
| Gemini native | Gemini (OpenCode Google provider) | SDK adapter, Gemini native API | via proxy |
| Other SDK providers | Cerebras, Perplexity, Bedrock, Vertex, Together AI, etc. | Whatever api.npm OpenCode assigns |
via proxy |
| Not in cloud wizard | GPT, Gemini on OpenCode Zen/Go | Use an OpenCode-configured provider instead (OpenAI/Google in OpenCode config) | not yet supported |
The SDK adapter proxy starts on a random local port for proxy-routed models and stops when Claude Code exits. Each relay-ai claude session gets its own port, so multiple terminals are fine. (relay-ai server uses fixed port 17645. One server instance per machine.)
Provider notes
Mistral (free tier): Rate limits are tight. Expect HTTP 429 during tool-heavy sessions. Claude Code retries with backoff. That's Mistral throttling, not a proxy bug.
OpenAI (OpenCode-configured provider): Configure OpenAI in OpenCode with your API key, then pick the OpenAI provider at launch. Newer GPT models use OpenAI's Responses API. The SDK picks responses vs chat from the model ID. OpenCode catalog IDs can differ from API IDs (e.g. gpt-5.5-fast maps to upstream gpt-5.5). If you see "model not available", run relay-ai claude --trace and check ~/.relay-ai/logs/claude-debug.log.
CLAUDE_CODE_DISABLE_EXPERIMENTAL_BETAS=1 is set for direct (non-proxy) routes only. Proxy sessions keep tool-search betas.
API key storage
relay-ai uses @napi-rs/keyring for the OS credential store. On later runs it checks silently. Key found? Wizard skips the prompt.
| Platform | Credential store | Notes |
|---|---|---|
| macOS | macOS Keychain | Optional ~/.zshrc auto-load line for system-wide availability |
| Windows | Windows Credential Manager | setx available as plaintext alternative |
| Linux (desktop) | Secret Service API (GNOME Keyring, KWallet) | Needs a running keyring daemon |
| Linux (headless) | Not available | Falls back to shell profile or session-only |
If the native module fails to load, credential store options are skipped and you get shell profile / session-only storage.
Configuration
Provider registry (no secrets in this file):
~/.relay-ai/providers.json
Manage with relay-ai providers. API keys are stored in the OS keychain (keyring:provider:<id>).
App preferences — favorites, last provider/model, server settings, optional server password:
~/.relay-ai/config.json
Override the config directory:
export RELAY_AI_HOME="/path/to/your/relay-ai-home"
The OpenCode API key (for Zen/Go) and per-provider keys are stored separately, based on what you chose during setup (Keychain, credential store, or shell profile).
Troubleshooting
See docs/TROUBLESHOOTING.md for common issues — especially “Not logged in” after accidentally choosing No on Claude Code’s custom API key prompt.
Upgrading from opencode-starter
If you used the old opencode-starter CLI, relay-ai migrates automatically on first run:
- Config moves from
~/.opencode-starter/→~/.relay-ai/ - Legacy Keychain / credential-store entries are read and re-saved under
relay-ai - The CLI command is now
relay-ai(notopencode-starter) - Launch Claude Code with
relay-ai claude(barerelay-aiprints help)
The deprecated OPENCODE_STARTER_HOME env var still works as a fallback for RELAY_AI_HOME.
Contributing
Private beta right now. Issues and PRs welcome on GitHub.
Disclaimer
This project and its creator have no affiliation with OpenCode, Anthropic, Claude, Google, or any other vendor named or integrated here. Trademarks belong to their respective owners.
relay-ai was built for education and research, and mostly for fun. It routes inference through services you configure yourself (OpenCode Zen/Go, OpenCode-configured providers, Vertex AI, and gateways you run locally). Use at your own risk.
Vibe Coding Alert
Full transparency: this project was vibe coded with AI coding assistants. If you're an experienced developer, you might look at parts of this codebase and wince. That's okay.
The goal was to scratch an itch: launch Claude Code and Claude Desktop (Cowork + Code) against OpenCode backends and Vertex without fighting env vars, proxies, and model discovery. The code works. It's not corporate polish.
If something makes you cringe, open an issue or PR. Human expertise is irreplaceable. For the tone and spirit of this section, see notebooklm-mcp-cli on the same GitHub org.
License
MIT