I wish Claude just knew how I work without me explaining - so I made something that quietly observes me, learns and teaches it. Open source
TL;DR Highlight
A Mac app that automatically creates Skills by observing your actual work instead of repeatedly entering the same context for each Claude Code session.
Who Should Read
Developers who use Claude Code or AI coding agents daily and are tired of explaining the same workflows in each session. Especially freelancers or startup developers with many repetitive workflows such as PR reviews and client onboarding.
Core Mechanics
- Operates as a Mac menu bar app, observing the screen to automatically convert work patterns into 'Skill' files and deliver them to Claude or other agents via MCP (Model Context Protocol).
- Offers two capture modes: 'Focus Record' (record once and answer a few questions to create a Skill) and 'Passive Discovery' (quietly observe in the background for a few days).
- Passive Discovery distinguishes work from noise using an activity classifier, connects related tasks across sessions with cross-session linking even if a session is interrupted, and synthesizes patterns into Skills after observing them at least three times.
- Processed through an 11-step local pipeline: screen capture → frame annotation with a local VLM (Qwen 2.5 via Ollama) → semantic embedding for grouping similar workflows → all executed on your device.
- Skill files are structured playbooks containing strategy, steps, guardrails, and writing tone, not just simple prompts, and their confidence score is updated and self-improved based on execution success/failure.
- Screenshots are deleted after processing, PII (Personally Identifiable Information) and API keys are automatically masked, stored with local encryption, and no telemetry is sent — data does not leave the device.
Evidence
- Operates through an 11-step local pipeline, with Qwen 2.5 (local VLM based on Ollama) annotating each frame with predictions of the current app, current action, and next action to improve accuracy.
- Real-world use has confirmed cases where Passive Discovery automatically discovered workflows that the user themselves were not aware of (e.g., Monday metrics routines, GitHub issue triage patterns).
- Each Skill must pass a lifecycle gate after being observed at least three times before it can be used by an agent, and the confidence score is updated with each execution.
How to Apply
- For tasks with clear procedures like a PR review process, recording once with Focus Record immediately creates a Skill, and Claude Code automatically references that Skill via MCP in subsequent sessions.
- For routines where you don't know if there's a pattern (issue triage, weekly reports, etc.), leaving Passive Discovery running in the background for a few days will automatically find patterns and create Skills.
- The generated Skill files can be immediately connected to other agent tools that support MCP, such as OpenClaw and Codex, allowing you to reuse the same context even when switching agents.
Code Example
# 1. Clone the repo and install
git clone https://github.com/sandroandric/AgentHandover
cd AgentHandover
# 2. Prepare a local VLM with Ollama
ollama pull qwen2.5
# 3. Connect to the MCP server from Claude Code (example claude_desktop_config.json)
{
"mcpServers": {
"agenthandover": {
"command": "agenthandover-mcp",
"args": []
}
}
}
# 4. Example flow for creating a Skill with Focus Record
# - Run the menu bar app → Click Record
# - Actually perform the "PR review" task
# - Answer a few questions after completion
# → skills/pr_review.skill.json is automatically generated
# 5. Claude Code session automatically references the Skill
# Claude loads the pr_review Skill via MCP and
# automatically applies the corresponding strategy/steps/guardrailsTerminology
Related Papers
Formal Verification Gates for AI Coding Loops
AI가 생성한 코드에서 보안 불변식(invariant)을 지키게 하려면 프롬프트 지시보다 타입 시스템 같은 구조적 제약이 훨씬 효과적이라는 주장과 구현 방법을 소개한다.
Learnings from 100K lines of Rust with AI (2025)
Azure RSL(분산 합의 라이브러리)을 Rust로 재구현하면서 AI 코딩 에이전트를 활용해 4주 만에 100K 라인을 작성한 경험담으로, Code Contracts와 Spec-Driven Development를 AI와 조합하는 실전 워크플로우를 공유한다.
A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents
LLM agent가 왜 터지는지 이름 붙이고, 어떤 아키텍처 패턴을 언제 써야 하는지 5단계로 정리한 실전 가이드
Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks
작은 로컬 LLM(8B)에 guardrails(구조적 안전망)를 씌워 멀티스텝 에이전트 작업 성공률을 53%에서 99%까지 올린 Python 프레임워크 Forge 공개. 모델 자체는 건드리지 않고 실행 환경을 강화하는 접근법이라 주목받고 있음.
Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised
2026년 5월 19일, npm 계정 하나가 탈취되어 22분 만에 637개 악성 버전이 배포됐고, echarts-for-react·size-sensor 등 월 수백만 다운로드 패키지들이 감염되어 AWS 자격증명·SSH 키·AI 코딩 에이전트까지 탈취하는 정교한 공급망 공격이 발생했다.
Code as Agent Harness
LLM 에이전트에서 코드를 단순 출력물이 아닌 추론·행동·환경 모델링의 실행 인프라로 재정의한 102페이지짜리 서베이