An agentic development harness for Claude Code, Codex & Cursor: gated pipeline from spec to green checks.
npx skills add https://github.com/heliohq/ship --skill refactorقم بتثبيت هذه المهارة باستخدام واجهة سطر الأوامر (CLI) وابدأ في استخدام سير عمل SKILL.md في مساحة عملك.
An agentic development harness for Claude Code, Codex & Cursor: gated pipeline from spec to green checks.
Ship orchestrates end-to-end software development — planning, implementation, E2E tests, review, QA, simplify, and PR creation — with quality gates at every transition.

Ship is a harness, not a copilot. It doesn't help AI write code — it constrains AI to produce reliable results through mechanically enforced quality gates.
The problem Ship solves: AI coding agents are capable but unreliable. They skip tests, hallucinate about code they haven't read, review their own work and call it good, and declare victory without evidence. Ship makes these failure modes structurally impossible.
/plugin marketplace add heliohq/ship
/plugin install ship@heliohq
Fetch and follow instructions from https://raw.githubusercontent.com/heliohq/ship/refs/heads/main/.codex/INSTALL.md
Codex uses hooks instead of plugins. See .codex/INSTALL.md for setup.
/add-plugin ship
Or search for Ship in the Cursor plugin marketplace.
Open a fresh session and give it a task — for example, "plan out a user authentication system". Ship should kick in automatically.
/plugin update ship
Run /ship:auto and Ship handles the full pipeline. Or run individual phases when you only need one:
| Skill | Description |
|---|---|
/ship:auto |
Full pipeline: design → dev → E2E → review → QA → simplify → handoff |
/ship:setup |
Bootstrap repo infrastructure, generate AGENTS.md and safety rules |
/ship:design |
Adversarial spec + plan with peer challenge rounds |
/ship:dev |
Host implements, peer (Codex) cross-validates; parallel waves for file-independent stories |
/ship:e2e |
Codify the change's acceptance criteria as persistent E2E tests, detect or scaffold the framework, run them against the real app |
/ship:review |
Bug-focused diff review — no style nits |
/ship:qa |
Exploratory sweep against the running app, finds what codified tests missed |
/ship:handoff |
PR creation + CI fix loop until checks green |
/ship:refactor |
Four-lens scan, classify by risk, apply with verification |
/ship:learn |
Capture session mistakes into persistent learnings |
/ship:arch-design |
System design thinking — requirements, components, trade-offs, scaling |
/ship:write-docs |
Project documentation with frontmatter, lifecycle, and indexing |
/ship:visual-design |
DESIGN.md visual system for consistent UI generation |
Skills trigger automatically based on what you're doing. Session start injects a short Ship routing policy that reminds the agent to invoke the matching /ship:* skill before proceeding and to default to /ship:auto for end-to-end feature work.
See docs/skills.md for detailed guides.
Ship is built on ideas from:
/ship:visual-design.learnings concept that inspired /ship:learn's staged learning lifecycle/simplify — The built-in skill that inspired /ship:refactor's four-lens scan