Comet AI browser can get prompt injected from any site, drain your bank account
TL;DR Highlight
Brave's AI browser Comet is vulnerable to prompt injection when reading web pages, enabling malicious sites to hijack the LLM to access emails, initiate payments, and perform other sensitive actions.
Who Should Read
Developers integrating LLM-based agents into products, or engineers designing security architecture for AI browsers and AI email clients.
Core Mechanics
- Brave browser's AI agent feature 'Comet' executes hidden malicious prompts found in web pages when summarizing or performing tasks — a prompt injection vulnerability.
- Comet has broad permissions including cross-tab data access, email reading, and form filling, allowing an attacker to scan user emails or attempt payments from a single web page.
- Major players like Google, OpenAI, and Anthropic run similar features in isolated VMs without cookies, while Comet operates directly on the user's actual browser session — fundamentally unsafe.
- Brave acknowledged the vulnerability in a blog post but proposed 'model alignment to detect dangerous actions' — the community criticized this as meaningless given that models are immediately jailbroken in practice.
- Key concept: When an LLM 'reads' external data via tools, it's effectively allowing 'writes' to the context window. If it can read untrusted sources, those sources can manipulate the LLM's behavior.
- At USENIX Security, it was confirmed that no one yet knows how to fundamentally prevent prompt injection in multi-turn/agent environments. It remains an unsolved problem in academia.
- Similar vulnerabilities were found in AI email clients (Shortwave, etc.), and the 'Month of AI Bugs' project continues collecting similar cases.
- A user tested Comet by saying 'buy me a guitar on Amazon' — it added 3 cheap no-brand guitars to the cart without any confirmation. Fortunately it didn't complete the purchase, but it demonstrates reckless agent behavior.
Evidence
- Many commented that there's a reason Google/OpenAI/Anthropic haven't shipped this feature. They use cookieless isolated VMs for web browsing, while Comet directly exposes the user session — consensus was it's 'fundamentally unsafe.'
- The framing that 'every read action by an LLM tool is a write to the context window' gained strong agreement. The explanation that being able to read untrusted sources is itself an attack vector became a frequently cited core principle of agent security.
- Some argued agentic AI should only be used for easily reversible tasks (code writing/editing via git) — using it for irreversible actions like web browsing, payments, and email is reckless.
- Brave's proposed mitigations ('browser distinguishes user instructions from website content,' 'model verifies alignment with user intent') were strongly criticized as ineffective given that models get jailbroken immediately upon release.
- Someone noted the irony: decades of encrypting network layers one by one (even DNS), and now we're handing over all passwords and secrets via plaintext APIs.
How to Apply
- When implementing LLM agents that read external content (web pages, emails, documents), assume that reading itself is an attack vector. Isolate external inputs in separate contexts and always require user confirmation before invoking sensitive tools (payments, email sending).
- Minimize tool permissions granted to agents. A 'web page summary' feature doesn't need email access, form filling, or cross-tab data sharing. Separate permissions per task, and route irreversible actions (payments, messages) through a separate approval flow.
- When designing agent-based services, use 'rollback capability' as the criterion for automation scope. Code changes (git reset possible) are safe to automate, but payments, email sending, and account settings changes should be restricted from direct agent execution.
- If running AI agents in production, regularly check monthofaibugs.com to track similar vulnerability patterns and audit whether the same attacks are possible on your service.
Terminology
Related Papers
Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion
Git 기반 동기화와 Claude/Codex/Cursor 연동을 내장한 로컬 우선 마크다운 에디터로, AI 에이전트의 두 번째 뇌(LLM Wiki)로 활용할 수 있는 오픈소스 도구다.
The Unfireable Safety Kernel: Execution-Time AI Alignment for AI Agents and Other Escapable AI Systems
AI 에이전트가 자신의 안전장치를 우회할 수 없도록, 에이전트 프로세스 바깥에 수학적으로 증명된 강제 통제 게이트를 배치하는 아키텍처
RubyLLM: A Ruby framework for all major AI providers
OpenAI, Claude, Gemini 등 주요 AI 프로바이더를 단일 인터페이스로 통합한 Ruby 프레임워크로, Rails 통합과 에이전트 기능까지 지원해 Ruby 개발자가 AI 기능을 빠르게 붙일 수 있다.
Qwen-AgentWorld: Language World Models for General Agents
Alibaba Qwen 팀이 AI 에이전트가 행동 결과를 미리 시뮬레이션할 수 있는 'Language World Model'을 공개했다. 에이전트 훈련과 실행 경로 검증에 새로운 패러다임을 제시하는 연구다.
SHERLOC: Structured Diagnostic Localization for Code Repair Agents
버그 위치만 알려주는 게 아니라 '왜, 어떻게 고쳐야 하는지'까지 진단 리포트를 생성해서 코드 수정 에이전트의 성능을 높이는 training-free 프레임워크
Show HN: peerd – AI agent harness that runs entirely in your browser
백엔드 서버 없이 Chrome/Firefox 확장 프로그램으로만 동작하는 AI 에이전트 실행 환경으로, 브라우저 탭을 직접 조작하고 WASM Linux VM까지 구동할 수 있어 프라이버시와 보안을 동시에 챙길 수 있다.