An AI Skill that transforms PDFs into clean Markdown using MinerU's VLM engine. Supports LaTeX formulas, tables, images, and batch async processing.
npx skills add https://github.com/nebutra/mineru-skill --skill mineruInstall this skill with the CLI and start using the SKILL.md workflow in your workspace.
An AI Skill that transforms PDF documents into clean Markdown using MinerU's VLM engine.
ไธญๆๆๆกฃ | English
A Skill is an AI capability package that extends your AI assistant's abilities. When you ask the AI to do something, it automatically:
You: ่งฃๆ่ฟไบ่็ ๆฐๅญฆ็้ข PDF ๅฐๆ็ Obsidian
AI: ๐ ๅ็ฐ 40 ไธช PDF ๆไปถ
โณ ๅผๅงๅนถ่ก่งฃๆ (5 workers)...
โ
1993ๅนด่็ ๆฐๅญฆ๏ผไธ๏ผ็้ข โ Markdown
โ
1994ๅนด่็ ๆฐๅญฆ๏ผไธ๏ผ็้ข โ Markdown
...
โ
ๅฎๆ๏ผๅทฒไฟๅญๅฐ Obsidian/่็ /ๆฐๅญฆไธ/
npx skills add Nebutra/MinerU-Skill
Supported: OpenCode, Claude Code, Codex, Cursor, 35+ more
# Clone to your skills directory
git clone https://github.com/Nebutra/MinerU-Skill.git ~/openclaw-skills/mineru/
# Set API token
export MINERU_TOKEN="your-token-here" # Get from https://mineru.net/user-center/api-token
# Install via clawhub CLI
clawhub install mineru
# Clone to AI skills folder
git clone https://github.com/Nebutra/MinerU-Skill.git ~/.claude/skills/mineru/
ๆ ./document.pdf ่งฃๆๆ Markdown
่งฃๆ ./papers/ ็ฎๅฝไธ็ๆๆ PDF๏ผ่พๅบๅฐ ./output/
ๆ่ฟไบ PDF ่งฃๆๅ็ดๆฅไฟๅญๅฐๆ็ Obsidian Vault
่งฃๆ 1987-2025 ๅนด่็ ๆฐๅญฆ็้ข๏ผไฟๅญๅฐ Obsidian/่็ /ๆฐๅญฆไธ/
็จ 10 ไธชๅนถๅ๏ผ่ทณ่ฟๅทฒๅค็็ๆไปถ
| Feature | Description |
|---|---|
| ๐ PDF Input | Local files, URLs, batch directories |
| ๐ Output | Markdown + JSON metadata + Images |
| ๐ข LaTeX | Math formulas preserved |
| ๐ Tables | Structure extraction |
| ๐ผ๏ธ Images | Auto-extracted to images/ |
| โก Async | 15x parallel uploads |
| ๐ Resume | Skip processed files |
| ๐ Obsidian | Direct vault output |
You can also use directly via CLI:
# Single file
python scripts/mineru_v2.py --file ./doc.pdf --output ./output/
# Batch with resume
python scripts/mineru_v2.py \
--dir ./pdfs/ \
--output ~/Obsidian/MyVault/ \
--workers 10 \
--resume
| Option | Description |
|---|---|
--dir PATH |
Input directory |
--file PATH |
Single file |
--output PATH |
Output directory |
--workers N |
Concurrency (default: 5) |
--resume |
Skip processed files |
--token TOKEN |
API token |
output/
โโโ document-name/
โ โโโ document-name.md # Main Markdown
โ โโโ images/ # Extracted images
โ โ โโโ image_0_0.png
โ โ โโโ ...
โ โโโ content.json # Metadata
โโโ ...
Test: 10 PDFs, ~15 pages each (MacBook Air M1)
| Configuration | Time | Speed |
|---|---|---|
| Sequential | 8.5 min | 1.2 files/min |
| Async (5 workers) | 3.2 min | 3.1 files/min |
| Async (15 workers) | 1.8 min | 5.6 files/min |
export MINERU_TOKEN="your-token-here"
Free Tier: 2000 pages/day, 200MB max file
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ USER REQUEST โ
โ "Parse these PDFs to Markdown" โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ AI ASSISTANT โ
โ โข Recognizes PDF parsing task โ
โ โข Activates MinerU skill โ
โ โข Reads SKILL.md for instructions โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ MINERU SKILL ENGINE โ
โ Scanner โโโบ Scheduler โโโบ Worker Pool (N workers) โ
โ โ โ
โ โผ โ
โ API: Get URL โโโบ Upload โโโบ Poll โโโบ Download โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ OUTPUT โ
โ Markdown + JSON + Images โโโบ Obsidian/Files โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
MIT License - see LICENSE
If this skill helps you, give it a โญ!
Made with โค๏ธ by Nebutra