Claude Code Dedicated Development Harness - Achieving High-Quality Development Through an Autonomous Plan→Work→Review Cycle
npx skills add https://github.com/Chachamaru127/claude-code-harness --skill uiInstale esta skill com a CLI e comece a usar o fluxo de trabalho SKILL.md em seu espaço de trabalho.
Plan. Work. Review. Ship.
Turn Claude Code into a disciplined development partner.
English | 日本語
Hokage line carries forward. Full Claude Code 2.1.99-2.1.110 + Opus 4.7 integration. Plugin manifest now matches the official
plugins-referenceschema.
Anthropic shipped Opus 4.7 with literal-instruction-following semantics, plus Claude Code 2.1.105 added the PreCompact hook and the monitors/monitors.json manifest. v4.2 makes Harness fit those changes precisely:
| Area | Before (v4.1) | After (v4.2) |
|---|---|---|
| Long-running workers | Could be silently compacted mid-task | PreCompact hook blocks compaction while a worker is active or Plans.md is dirty |
| Plugin validation | claude plugin validate rejected monitors/agents blocks |
Public-spec compliant: monitors/monitors.json + agents/ auto-discovery |
| Sync regression | harness sync could silently strip declared blocks (4 prior incidents) |
Two-layer guard: shell idempotency test + Go struct test for phantom fields |
| Long sessions | Default 5-minute prompt cache only | bash scripts/enable-1h-cache.sh opt-in to 1-hour TTL (CC v2.1.108) |
| Reviewer/Advisor effort | medium / high |
xhigh (CC v2.1.111, Opus 4.7) — sharper review, with high fallback for other models |
| Agent prompts | Worked on Opus 4.6 implicit semantics | Re-tuned for Opus 4.7 literal instruction following — explicit thresholds, schemas, command names |
| Guardrails (R01-R13) | Conformed to CC 2.1.98 contract | Re-conformed to CC 2.1.110 (PermissionRequest updatedInput, PreToolUse additionalContext, Bash bypass closures) — 17 new regression tests |
What you'll notice:
claude plugin validate runs clean on Harness for the first time since monitors was addedharness sync stops mysteriously deleting your monitors/agents blockUpdate with the usual flow:
/plugin update claude-code-harness
For detailed Before/After in Japanese, see CHANGELOG.md [4.2.0] entry.
Go-native engine. 25x faster hooks. Zero Node.js dependency.
Every tool call Claude makes passes through Harness hooks. In v3, each pass cost 40-60ms of bash + Node.js overhead — a subtle drag you feel across hundreds of calls per session. v4 replaces the entire stack with a single Go binary:
| v3 (bash + Node.js) | v4 "Hokage" (Go) | |
|---|---|---|
| PreToolUse | 40-60ms | 10ms |
| SessionStart | 500-800ms | 10-30ms |
| PostToolUse | 20-30ms | 10ms |
| Node.js | Required (18+) | Not needed |
What you'll notice:
npm install or Node.js version issues on setupJust update the plugin — no configuration changes needed:
/plugin update claude-code-harness
Claude Code is powerful. Harness turns that raw capability into a delivery loop that is easier to trust and harder to derail.
The 5 verb skills keep setup, plan, work, review, and release on one path. The Go-native guardrail engine protects execution with sub-10ms response, and validation can be rerun when you need proof.
What matters here is not the theoretical ceiling of Claude Code. It is what becomes the default operating model once you install a harness — and what happens when you pair it with harness-mem.
With harness-mem, Claude Harness is the only setup where the default path is planned, guarded, reviewed, rerunnable, and remembered across sessions.
Full notes and source links: docs/github-harness-plugin-benchmark.md
Supported baseline and latest verified snapshot: see Claude Code Compatibility.
xhigh effort and Opus 4.7 supportclaude-opus-4-7) recommended for full v4.2 benefit (literal instruction following, vision 2576px, xhigh effort)| You Are | Harness Helps You |
|---|---|
| Developer | Ship faster with built-in QA |
| Freelancer | Deliver review reports to clients |
| Indie Hacker | Move fast without breaking things |
| VibeCoder | Build apps with natural language |
| Team Lead | Enforce standards across projects |
# Start Claude Code in your project
claude
# Add the marketplace & install
/plugin marketplace add Chachamaru127/claude-code-harness
/plugin install claude-code-harness@claude-code-harness-marketplace
# Initialize your project
/harness-setup
That's it. Start with /harness-plan.
Harness now ships with English as the default setup language. Japanese remains
fully supported as an explicit opt-in:
i18n:
language: ja
You can also start a temporary Japanese setup with
CLAUDE_CODE_HARNESS_LANG=ja claude. The Japanese README is
README_ja.md.
Don't want to read all this? Just type:
/harness-work all
One command runs the full loop after plan approval. Plan → Parallel Implementation → Review → Commit.
⚠️ Experimental workflow: Once you approve the plan, Claude runs to completion. Validate the success/failure contract in docs/evidence/work-all.md before depending on it in production.
/harness-setup
Bootstraps project files, rules, and command surfaces so the rest of the loop runs against the same conventions.
/harness-plan
"I want a login form with email validation"
Harness creates Plans.md with clear acceptance criteria.
/harness-work # Auto-detect parallelism
/harness-work --parallel 5 # 5 workers simultaneously
Each worker implements, runs a preflight self-check, and waits for an independent review verdict before completion.
Harness is starting to adopt an advisor-style execution flow.
In plain terms, the executor keeps moving, and only asks for help when the decision is genuinely hard.
The first rollout is in harness-loop, because long-running execution benefits most from "ask only when needed."
Details: docs/advisor-strategy.md
/harness-review
| Perspective | Focus |
|---|---|
| Security | Vulnerabilities, injection, auth |
| Performance | Bottlenecks, memory, scaling |
| Quality | Patterns, naming, maintainability |
| Accessibility | WCAG compliance, screen readers |
/harness-release
Packages the verified result into CHANGELOG, tag, and release handoff steps after implementation and review are complete.
Harness v4 protects your codebase with a Go-native guardrail engine (go/internal/guardrail/) — 13 declarative rules (R01–R13), single binary with sub-10ms response:
| Rule | Protected | Action |
|---|---|---|
| R01 | sudo commands |
Deny |
| R02 | .git/, .env, secrets |
Deny write |
| R03 | Shell writes to protected files | Deny |
| R04 | Writes outside project | Ask |
| R05 | rm -rf |
Ask |
| R06 | git push --force |
Deny |
| R07–R09 | Mode-specific and secret-read guards | Context-aware |
| R10 | --no-verify, --no-gpg-sign |
Deny |
| R11 | git reset --hard main/master |
Deny |
| R12 | Direct push to main / master |
Warn |
| R13 | Protected file edits | Warn |
| Post | it.skip, assertion tampering |
Warning |
| Perm | git status, npm test |
Auto-allow |
Runtime differences between Claude Code hooks and Codex CLI gates are documented in docs/hardening-parity.md.
v4 unifies 42 skills into 5 verb skills. Start with the verbs first, then add Breezing, Codex, or 2-agent flows only when you need them.
/planIdeas → Plans.md |
/workParallel implementation |
/review4-angle code review |
/releaseTag + GitHub Release |
/setupProject init & config |
| Command | What It Does | Legacy Redirect |
|---|---|---|
/harness-plan |
Ideas → Plans.md |
/plan-with-agent, /planning |
/harness-work |
Parallel implementation | /work, /breezing, /impl |
/harness-work all |
Approved plan → implement → review → commit | /work all |
/harness-review |
4-perspective code review | /harness-review, /verify |
/harness-release |
CHANGELOG, tag, GitHub Release | /release-har, /handoff |
/harness-setup |
Initialize project | /harness-init, /setup |
/memory |
Manage SSOT files | — |
harness doctor --residue |
Detect stale references to deleted code | — |
claude-code-harness/
├── go/ # Go-native guardrail + hookhandler engine
│ ├── cmd/harness/ # CLI entry point (sync, doctor, validate)
│ ├── internal/ # guardrail / hookhandler / state / lifecycle / breezing
│ └── pkg/ # config / hookproto (public API)
├── bin/ # Compiled harness binaries (darwin-arm64/amd64, linux-amd64)
├── skills/ # 5 verb skills + extension skills (plan/execute/review/release/setup, etc.)
├── agents/ # 3 agents (worker / reviewer / scaffolder)
├── hooks/ # CC hook configuration (hooks.json)
├── scripts/ # Auxiliary shell scripts
└── templates/ # Generation templates
Run entire task lists with autonomous agent teams:
/harness-work breezing all # Plan review + parallel implementation
/harness-work breezing --no-discuss all # Skip plan review, go straight to coding
Phase 0 (Planning Discussion) runs by default — Planner analyzes task quality, Critic challenges the plan, then you approve before coding starts. 8+ tasks auto-split into manageable batches.
When harness-mem is running, Harness automatically records session events — what you worked on, what tools were used, and how the session ended. Next time you start a session, that context is available for retrieval.
.claude/state/memory-bridge-events.jsonl (no external dependency)No configuration needed — Harness detects harness-mem automatically.
Delegate implementation tasks to OpenAI Codex in parallel. Codex implements, self-reviews, and reports back.
/harness-work --codex implement these 5 API endpoints
/harness-review --codex # 4 perspectives + Codex second opinion
Setup: Install Codex CLI and configure API key. Or run
./scripts/setup-codex.sh --userto use Harness skills inside Codex CLI directly.
Use Cursor as PM, Claude Code as implementer. Plans.md syncs between both.
/harness-release handoff # Report to Cursor PM
/generate-slide # 3 visual patterns, quality scoring, auto-export
/generate-video # JSON Schema-driven pipeline with Remotion
Dependencies: Slides need
GOOGLE_AI_API_KEY. Video needs Remotion + ffmpeg.
Skill packs can teach a prompt. Harness also enforces behavior at runtime.
/harness-work all behavior.| Issue | Solution |
|---|---|
| Command not found | Run /harness-setup first |
harness-* commands missing on Windows |
Update or reinstall the plugin. Public command skills now ship as real directories, so core.symlinks=false no longer hides them. |
| Plugin not loading | Clear cache: rm -rf ~/.claude/plugins/cache/claude-code-harness-marketplace/ and restart |
| Hooks not working | Run bin/harness doctor to diagnose (Go binary, no Node.js needed) |
| Stale v3 references after migration | Run bin/harness doctor --residue — auto-detects leftover references to deleted code |
For more help, open an issue.
/plugin uninstall claude-code-harness
Project files (Plans.md, SSOT files) remain unchanged.
Harness leverages the latest Claude Code features automatically. Here are the ones you'll notice:
| What You Get | How It Works |
|---|---|
| Parallel safe writes | Worktree isolation lets multiple workers edit the same file |
| Smart effort scaling | Complex tasks auto-trigger ultrathink mode |
| Auto-escalation | 3 consecutive tool failures trigger recovery |
| LLM quality guards | Agent hooks run security and quality checks on every edit |
| Team monitoring | Breezing auto-detects when workers finish or idle |
| Model flexibility | Use any provider (Bedrock, Vertex, etc.) via modelOverrides |
Full technical list (19 features): docs/CLAUDE-feature-table.md
| Resource | Description |
|---|---|
| Changelog | Version history |
| Claude Code Compatibility | Requirements |
| Distribution Scope | Included vs compatibility vs development-only paths |
| Work All Evidence Pack | Success/failure verification contract |
| Cursor Integration | 2-Agent setup |
| Benchmark Rubric | Static vs executed evidence scoring |
| Positioning Notes | Public-facing differentiation language |
| Content Layout | Source docs vs generated outputs convention |
Issues and PRs welcome. See CONTRIBUTING.md.
MIT License — Free to use, modify, commercialize.