Reallocating $100/Month Claude Code Spend to Zed and OpenRouter
TL;DR Highlight
This article shares how a developer, tired of usage limits with the Claude Code Max plan ($100/month), switched to a combination of Zed editor ($10/month) + OpenRouter (pay-as-you-go), gaining credit rollover and freedom in model selection.
Who Should Read
Developers who are frustrated by frequent usage limit issues with their Claude Code Max subscription, or developers who want to optimize the cost of AI coding tools.
Core Mechanics
- The author feels they are hitting the limits faster than before with a combined $100/month for Claude Code and the Claude desktop app. This is a common complaint reported by several people on Reddit and Twitter, including AMD's AI Senior Director.
- The author's usage pattern is 'burst' - using it in concentrated periods - which doesn't efficiently utilize the monthly subscription window. The core complaint is that credits expire even during periods of non-use.
- The proposed alternative is Zed editor ($10/month) + OpenRouter (prepaid credits). A total budget of $100 is allocated with $10 for Zed and $90 charged to OpenRouter each month, consuming only what is used.
- OpenRouter credits are valid for 365 days if unused, allowing unused credits to roll over to the next month. The key advantage is avoiding the loss of credits due to missed reset cycles like with Anthropic subscriptions.
- The author finds Zed to be perceptually faster and more responsive than VSCode and its forks. It comes with a built-in Agent harness (a system for sending messages to LLMs and coordinating tool calls like file reading/writing) and can directly run external CLI tools like Claude Code through ACP (Agent Client Protocol).
- While Zed's native integration limits Gemini 2.5's context window to 200k tokens, using OpenRouter integration allows leveraging the full 1M token context. This is why the author prefers OpenRouter API integration over Zed's native paid plan.
- OpenRouter charges a 5.5% fee (updated after comments on HN). To protect privacy, the author did not consent to using input/output data for product improvement (a 1% discount is offered for consent) and enabled the 'Zero Data Retention (ZDR) Endpoints Only' option in Workspace settings.
- If you prefer CLI tools over Zed, the OpenCode + OpenRouter combination is also an alternative. The author mentions that OpenCode conveniently recognizes existing CLAUDE.md configuration files and allows connecting to various models (GLM 5.1, Kimi K2, etc.).
Evidence
- Regarding the debate over whether OpenRouter's 5.5% fee is worthwhile, some argue that access to dozens of models with a single API key, request-by-request cost tracking, multi-model comparison for the same request, and API key separation management justify the fee.
- There was strong opposition claiming that Claude Max subscription offers much better value in reality. One commenter stated they are getting $600 worth of usage for $100 with the ccusage tool, while another claimed they get $1,000+ of usage for $100 based on Opus 4.6 high thinking mode, arguing that switching to OpenRouter API alone would be more expensive.
- Some shared experiences of it being 2-3 times more expensive when trying a similar approach. Using a mix of Sonnet/Gemini/GPT models for development cost 2-3 times more than the Anthropic subscription, leading to the analysis that the subscription price significantly subsidizes the API cost.
- Reviews of Zed itself were mixed. Some found it a good VSCode alternative initially, but small inconveniences accumulated over time. Issues like high memory usage with TypeScript language servers and emoji rendering bugs on Linux were pointed out. The overall DX (developer experience) was rated at 85% of VSCode.
- Several real-world experiences shared using OpenCode + Kimi K2 as a backup for Claude Code. It was considered fast enough and sufficient for basic web app tasks, although not reaching Sonnet levels. The fact that OpenCode immediately recognizes CLAUDE.md files made environment switching easy.
- GitHub Copilot $40 plan was mentioned as another cost optimization alternative. It provides access to GPT-5 and Claude models, and since GitHub mediates the API, model performance degradation might be less than subscribing to Claude directly. Combining it with ChatGPT $20 subscription could create a good setup for $60.
- There was an experience of a major UK retail bank rejecting a transaction with OpenRouter and forcing a refund. This raised concerns that AI model access might be restricted as financial regulations tighten.
How to Apply
- If you are a Claude Code Max subscriber ($100/month) with a 'burst' usage pattern frequently hitting limits, consider canceling your subscription and switching to Zed ($10/month) + OpenRouter prepaid charging ($90) to avoid credit waste. However, first verify with tools like ccusage whether the API cost is lower than your current subscription.
- If you are concerned about privacy when using OpenRouter, refuse to consent to using input/output data for product improvement in your OpenRouter account settings and enable the 'Zero Data Retention (ZDR) Endpoints Only' option in Workspace Guardrail to minimize data exposure. Note that some models, like those from Alibaba Cloud (e.g., qwen/qwen3-plus), may become unavailable.
- If you repeatedly have to stop work due to exceeding the Claude Code limit, install OpenCode CLI and connect it to GLM 5.1 or Kimi K2 models on OpenRouter to set up a backup environment. OpenCode reads existing CLAUDE.md configuration files directly, allowing you to continue working without separate environment setup.
- If you find the 200k token context window limitation with Gemini 2.5 in Zed inconvenient, connect your OpenRouter API key to Zed instead of using Zed's native integration to utilize the full 1M token context.
Terminology
Agent harnessAn orchestration layer that sends messages to LLMs, coordinates tool calls like file reading/writing or command execution, and manages retries on failure. Claude Code is a representative example.
ACP (Agent Client Protocol)A protocol for communication between editors (like Zed) and external AI agents (like Claude Code). Supporting it allows running multiple agents directly within the editor.
ZDR (Zero Data Retention)An option to not store API request/response data on the server. It's a setting to protect code or sensitive information from being included in model training or logs.
Context WindowThe maximum length of text an LLM can read in a single request. A larger window allows processing longer codebases or conversation histories at once.
ccusageA CLI tool that measures actual Claude Code API usage in terms of tokens and cost. It's used to check how much usage you're getting for your subscription fee.
burst patternA usage pattern where you don't use it evenly throughout the day but instead concentrate usage at specific times. It's less efficient with subscription-based pricing that assumes even distribution.