Claude.ai unavailable and elevated errors on the API
TL;DR Highlight
Anthropic’s entire service suite—Claude.ai, the API, Claude Code—became inaccessible for 1 hour and 18 minutes (17:34–18:52 UTC), sparking outrage among enterprise users over reliability concerns.
Who Should Read
Developers integrating the Claude API or Claude Code into production services, and team leaders grappling with LLM service availability and multi-model strategies.
Core Mechanics
- The outage began at 17:34 UTC on April 28, 2026, and was resolved at 18:52 UTC, lasting a total of 1 hour and 18 minutes. Affected services included claude.ai, Claude Console (platform.claude.com), Claude API (api.anthropic.com), Claude Code, Claude Cowork, and Claude for Government—essentially the entire service portfolio.
- The root cause was identified as an issue related to authentication. A surge in authentication errors occurred in API requests and Claude Code login paths, and claude.ai itself became inaccessible.
- Anthropic announced the investigation at 17:41 UTC, identified the problem at 17:51 UTC, reported work in progress at 18:33 UTC, transitioned to a monitoring phase at 18:59 UTC, and declared final resolution at 19:15 UTC, updating the status page throughout.
- Data shared from status.claude.com indicated that Claude’s uptime had fallen to the ‘one nine’ level—just over 90%—in the last 90 days. This level is widely considered unacceptable for production environments.
- A user from an organization spending over $200,000 monthly on the enterprise tier reported frequent outages in recent months and poor support, leading to anger from leadership. They stated that a ‘one nine’ level of reliability is unacceptable given the cost.
Evidence
- "A user spending over $200,000 monthly on Anthropic’s enterprise tier lamented frequent outages and poor support in recent months, indicating escalating frustration at the executive level and potentially leading to contract re-evaluation."
How to Apply
- If you rely on the Claude API as a single point of failure in production, consider adding automatic fallback logic to alternative models like OpenAI (Codex) or Google (Gemini). This can ensure continued operation during outages like the one experienced.
- Organizations spending tens of thousands of dollars monthly on the Claude API should regularly monitor Anthropic’s status.claude.com and subscribe to email/SMS alerts. Integrating with PagerDuty or Slack webhooks can reduce response times.
- Teams heavily using Claude Code in their workflow should set up alternative coding agents like OpenAI Codex CLI in parallel. This allows work to continue even when Claude Code is unavailable due to authentication issues.
- For teams of around 10 people where AI coding tool costs are a concern or stability is paramount, consider renting GPUs to self-host open models like Qwen or DeepSeek. While initial setup is required, it offers direct control over downtime risk and potential long-term cost savings.
Terminology
Related Papers
MTG Bench: Testing how well LLMs can play Magic
카드 게임 MTG의 규칙 준수 능력으로 LLM의 복잡한 규칙 추론 능력을 측정하는 독창적인 벤치마크로, gpt-5.5가 95.4점으로 1위를 차지했다.
ALIGNBEAM : Inference-Time Alignment Transfer via Cross-Vocabulary Logit Mixing
도메인 파인튜닝으로 망가진 LLM 안전성을, 재학습 없이 추론 시점에 작은 안전 모델에서 빌려와 복구하는 방법.
The iPad was on Tailscale: a WebRTC debugging story
WebRTC 데이터 채널에서 iPad만 응답을 못 받는 희귀 버그를 추적한 결과, webrtc-rs의 하드코딩된 MTU 상수와 Tailscale의 IPv6 Fragment 패킷 드롭이 동시에 작용한 복합 버그였다는 2주간의 디버깅 실화.
Can LLMs Beat Classical Hyperparameter Optimization Algorithms?
LLM 기반 하이퍼파라미터 최적화 에이전트와 CMA-ES, TPE 같은 고전 알고리즘을 직접 비교한 연구로, LLM 단독으로는 고전 방법을 이기지 못하지만 두 방법을 합친 하이브리드 'Centaur'가 최고 성능을 낸다는 결론이 나왔다.
What the Eyes See, the LLMs Miss: Exploiting Human Perception for Adversarial Text Attacks
Bold, 하이라이트, 공백 배치 같은 타이포그래피 트릭으로 GPT-4o, Llama Guard 등 10개 콘텐츠 모더레이션 시스템을 99% 이상 우회할 수 있다.
Did Claude increase bugs in rsync?
rsync 프로젝트에 Claude AI가 도입된 이후 버그가 늘었다는 소셜 미디어 주장을 실제 데이터와 통계 분석으로 검증한 글로, 결론적으로 Claude 도입 후 릴리즈가 역사적 분포에서 유독 버그가 많다는 통계적 근거는 없었다.