pixverse-ai-image-and-video-generator

Agent skill library for PixVerse CLI — helps AI agents (Claude Code, Cursor, Codex, etc.) generate videos and images through structured, composable workflows.

Installation
CLI
npx skills add https://github.com/pixverseai/skills --skill pixverse-ai-image-and-video-generator

Install this skill with the CLI and start using the SKILL.md workflow in your workspace.

Last updated 4/24/2026

PixVerse Skills

Agent skill library for PixVerse CLI — helps AI agents (Claude Code, Cursor, Codex, etc.) generate videos and images through structured, composable workflows.

What is this?

PixVerse CLI is a UI-free version of pixverse.ai — all models, parameters, and capabilities from the website are available as CLI commands with structured JSON output.

This repository provides skill files that teach AI agents how to use those commands correctly: which flags to pass, which models support which parameters, how to chain commands into pipelines, and how to handle errors.

Quick Start

# Install the CLI
npm install -g pixverse

# Authenticate
pixverse auth login

# Create a video
pixverse create video --prompt "A cat astronaut floating in space" --json

PixVerse CLI uses the same credit system as the website. Only subscribed users can use it. See subscription plans.

Skill Structure

skills/
  SKILL.md                          # Entry point — start here
  capabilities/                     # Individual command skills
    auth-and-account.md             #   Authentication & account management
    create-video.md                 #   Text-to-video, image-to-video, fusion
    create-and-edit-image.md        #   Text-to-image, image-to-image
    modify-video.md                 #   AI content editing (replace subjects, swap outfits, change backgrounds)
    motion-control.md               #   Character animation with motion reference video
    transition.md                   #   Keyframe transition animations
    post-process-video.md           #   Extend, upscale, speech, sound
    prompt-enhance.md               #   Prompt optimization for V6 video generation
    task-management.md              #   Poll and wait for generation tasks
    asset-management.md             #   List, download, upload, delete assets
    saved-folders.md                #   Organize assets into named folders
    template.md                     #   Browse and create from effect templates
    workspace.md                    #   Team workspace management
    mondo-poster-design.md          #   Mondo-style poster, book cover, album art design
  workflows/                        # Multi-step pipeline skills
    text-to-video-pipeline.md       #   End-to-end text-to-video
    image-to-video-pipeline.md      #   Animate an image into video
    text-to-image-to-video.md       #   Generate image then animate it
    image-editing-pipeline.md       #   Iterative image editing
    modify-video-pipeline.md        #   Modify video content then enhance
    motion-control-pipeline.md      #   Character animation end-to-end
    video-production.md             #   Full production (create + extend + audio + upscale)
    storyboard-to-video.md          #   Multi-shot storyboard → concatenated video
    batch-creation.md               #   Parallel batch generation
    mondo-poster-pipeline.md        #   End-to-end Mondo poster generation
    mondo-poster-to-video-pipeline.md #  Animate poster into cinematic video
  references/                       # Curated design knowledge
    mondo-poster/                   #   37 artist styles, composition, genre templates

Capabilities vs Workflows

  • Capabilities document a single command or command group — flags, models, parameter constraints, JSON output format, error codes.
  • Workflows compose multiple capabilities into end-to-end pipelines with step-by-step instructions.

Supported Models

Video Models

Model CLI value Modes Quality Duration Aspect Ratio
PixVerse V6 v6 (default) Video, Transition (first/last frame), Extend 360p 540p 720p 1080p 1-15 (any integer) 16:9 4:3 1:1 3:4 9:16 3:2 2:3 21:9
PixVerse C1 pixverse-c1 Video, Transition (first/last frame), Reference 360p 540p 720p 1080p 1-15 (any integer) 16:9 4:3 1:1 3:4 9:16 3:2 2:3
PixVerse v5.6 v5.6 Video, Transition, Reference, Extend, Motion Control 360p 540p 720p 1080p 1-10 (any integer) 16:9 4:3 1:1 3:4 9:16 3:2 2:3
Sora 2 sora-2 Video 720p 4 8 12 16:9 9:16
Sora 2 Pro sora-2-pro Video 720p 1080p 4 8 12 16:9 9:16
Veo 3.1 Standard veo-3.1-standard Video, Transition 720p 1080p 4 6 8 16:9 9:16
Veo 3.1 Fast veo-3.1-fast Video, Transition 720p 1080p 4 6 8 16:9 9:16
Grok Imagine grok-imagine Video 480p 720p 1-15 16:9 4:3 1:1 9:16 3:4 3:2 2:3
Seedance 2.0 Standard seedance-2.0-standard Video, Reference, Transition 480p 720p 4-15 16:9 4:3 1:1 3:4 9:16 21:9
Seedance 2.0 Fast seedance-2.0-fast Video, Reference, Transition 480p 720p 4-15 16:9 4:3 1:1 3:4 9:16 21:9
Kling O3 Pro kling-o3-pro Video, Reference, Transition 720p 3-15 16:9 9:16 1:1
Kling O3 Standard kling-o3-standard Video, Reference, Transition 720p 3-15 16:9 9:16 1:1
Kling 3.0 Pro kling-3.0-pro Video, Transition 720p 3-15 16:9 9:16 1:1
Kling 3.0 Standard kling-3.0-standard Video, Transition 720p 3-15 16:9 9:16 1:1

Image Models

Model CLI value Resolution Aspect Ratio
Qwen Image qwen-image 720p 1080p 1:1 16:9 9:16 4:3 3:4 5:4 4:5 3:2 2:3 21:9
Seedream 5.0 Lite seedream-5.0-lite 1440p 1800p auto 1:1 16:9 9:16 4:3 3:4 5:4 4:5 3:2 2:3 21:9
Seedream 4.5 seedream-4.5 1440p 2160p auto 1:1 16:9 9:16 4:3 3:4 5:4 4:5 3:2 2:3 21:9
Seedream 4.0 seedream-4.0 1080p 1440p 2160p auto 1:1 16:9 9:16 4:3 3:4 5:4 4:5 3:2 2:3 21:9
Gemini 2.5 Flash (aka Nanobanana) gemini-2.5-flash 1080p auto 1:1 16:9 9:16 4:3 3:4 5:4 4:5 3:2 2:3 21:9
Gemini 3.0 (aka Nano Banana Pro) gemini-3.0 1080p 1440p 2160p auto 1:1 16:9 9:16 4:3 3:4 5:4 4:5 3:2 2:3 21:9
Gemini 3.1 Flash (aka Nano Banana 2) gemini-3.1-flash 512p 1080p 1440p 2160p auto 1:1 16:9 9:16 4:3 3:4 5:4 4:5 3:2 2:3 21:9
Kling Image O3 kling-image-o3 1080p 1440p 2160p 16:9 9:16 1:1 4:3 3:4 3:2 2:3 21:9
Kling Image V3 kling-image-v3 1080p 1440p 16:9 9:16 1:1 4:3 3:4 3:2 2:3 21:9

For AI Agent Developers

These skills are designed to be loaded into agent context. Each skill file is self-contained with:

  • Decision trees — help the agent choose the right command
  • Flag tables — every parameter with allowed values and defaults
  • Model reference tables — per-model parameter constraints
  • JSON output schemas — exact response format for parsing
  • Exit codes — deterministic error handling
  • Examples — copy-paste-ready commands

Start by loading skills/SKILL.md as the entry point, then load specific capability or workflow skills as needed.

Community Skills

Projects built on top of PixVerse CLI by the community:

Project Author Description
pixverse-character-pipeline @takamasa045 Character-driven video production — one speaker image + YAML config → multi-language, multi-ratio talking-head videos with lip-sync, BGM, and Remotion rendering
pixverse-shotpack @takamasa045 Creative brief → video shot pipeline — transforms markdown briefs or YAML storyboards into organized, editor-ready AI-generated video assets

Have a project built on PixVerse CLI? Open a PR to add it here.

Credits

Thanks to the creators whose work has contributed to the PixVerse ecosystem:

  • @takamasa045 — for building character pipeline and shotpack production tools on PixVerse CLI
  • @vista8 — for the Mondo poster design system whose prompt engineering and artist style library are adapted in pixverse:mondo-poster-design